| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Medicinal Chemistry and Molecular Pharmacology, Markey Center for Structural Biology, Purdue Cancer Center, Purdue University, West Lafayette, Indiana
Correspondence: Address reprint requests to C. B. Post, E-mail: cbp{at}purdue.edu.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
In this article, we devise a method to estimate the compressibility of hydration water as a function of the varied chemical nature of the protein-water interface. A description of hydration based on the protein surface composition is attractive because it reflects the chemical nature of the protein-water interface (13
). The average solvent-exposed atomic-type composition estimated for a set of 16 globular proteins includes not only charged (12.2%) and polar (37.4%) atoms, but also a substantial number of apolar atoms (50.4%) (14
). One argument in favor of an approach based on surface composition is the fact that when only a limited number of water molecules is available, the molecules do not distribute uniformly on the protein solvent-accessible surface. Molecular dynamics simulations have found that 350 water molecules that fully hydrate myoglobin, originally distributed uniformly on the protein surface, arrange in such a way that they hydrate every charged group and cover 83% of the charged surface, 65% of the hydrophilic surface, and only 53% of the hydrophobic surface (15
).
Protein-water interactions are fundamental to the description of protein physical properties (1
,16
,17
). The hydration properties of water play an important role in the formation of the three-dimensional native structures of proteins and nucleic acids, and the dynamics of biological molecules (18
). Proteins do not perform their functions until a critical hydration level is achieved, and in some cases, the level of hydration that is required to restore enzymatic activity is less than one layer of water molecules (19
). A detailed description of the structure and properties of water in the protein hydration shell is essential for our understanding of the processes taking place in the interior of a biological cell, a gel-like matrix (20
,21
) where most water molecules are within a few solvation layers of proteins and other macromolecules.
There has been extensive work through both experimental (22
26
) and computer simulations methods (27
32
) to explain how solutes, and proteins in particular, change the properties of water layers adjacent to their surface. Theories of hydrophobic hydration (33
36
) relate the structural ordering of water by the nonpolar solutes to the hydration thermodynamics through solute-water pair correlations. The special properties of water molecules in the hydration shell of proteins are reflected in the compressibility and density, when compared to bulk water. Others have estimated the compressibility of hydration water (5
,37
42
) to be, on average, 22% lower than bulk water compressibility. Data from small-angle x-ray scattering combined with neutron scattering studies of lysozyme, thioredoxin reductase, and protein R1 (24
) show that the average density of the water in the protein first hydration shell is 10% higher than the density of bulk water, a result that is consistent with experimental findings from x-ray crystallography (43
). The estimated values for the hydration water compressibility and density are consistent with each other in that higher density restricts density fluctuations and therefore lowers compressibility.
We report here a model for hydration water compressibility derived from analysis of molecular dynamics (MD) simulations of protein solutions. The approach for estimating protein compressibility from volume fluctuations was established previously based on MD simulations for four globular proteins: trypsin, ribonuclease A, hen egg white (HEW) lysozyme, and
-lactalbumin (44
). In our model, the residual compressibility of the water in the protein hydration shell is accounted for by a linear combination of contributions from charged, polar, and apolar solvent-accessible surfaces. The physical basis for the hydration water compressibility model is the variation in water density indicated by surface-specific radial distribution functions (RDFs) of water around charged, polar, and apolar protein surfaces. The outcome is a formula to interpret experimental data on protein solution compressibility by deconvolution of intrinsic protein and hydration contribution when a protein of known structure is investigated. Further, we compare the hydration water compressibility for solvation of proteins with that of small peptides to assess whether hydration water compressibility determined from small compound data can be used to estimate the compressibility of hydration water surrounding proteins. The results indicate that small compound data cannot be directly used to estimate the compressibility of hydration water surrounding proteins.
| METHODS |
|---|
|
|
|---|
, is defined as the relative change in the solution volume, Vsol, under pressure (P) at constant temperature (T):
![]() | (1) |
The total volume is a sum of contributions from the protein molecular volume, Vp, the volume of the bulk water, Vw, and the volume of hydration water, Vh, as follows:
![]() |
Equation 1 can be rewritten as a sum over component volume contributions,
![]() | (2) |
![]() | (3) |
The ratios between the solution volume components and the total solution volume are the volume fractions:
,
, and
. In Eq. 3, we identify the protein compressibility,
![]() |
![]() |
![]() |
Finally, Eq. 2 for the compressibility of the protein solution becomes
![]() | (4) |
The hydration contribution to solution compressibility is estimated assuming that all water in the protein solution has the properties of bulk water, and then compare this estimate with the actual
(Eq. 4). In this procedure,
and the ideal solution compressibility is
![]() | (5) |
Any difference between the actual (Eq. 4) and the ideal (Eq. 5) solution compressibility,
![]() | (6) |
is the difference between the compressibility of hydration and bulk water, respectively.
For the experimental determination of protein compressibility, the adiabatic compressibility of a solution is given by the Laplace equation,
, where
is the density of the solution and u is the velocity of sound through the solution. The experimental, apparent partial adiabatic compressibility of a protein, is defined as the limit at 0 protein concentration of the following expression (38
):
![]() | (7) |
is the adiabatic solution compressibility,
is the compressibility of water,
is the partial specific volume of the protein, and
is the apparent volume fraction of the solvent in solution. The apparent protein compressibility has contributions from both intrinsic protein compressibility and a hydration contribution.
The adiabatic compressibility is related to isothermal compressibility (5
):
![]() | (8) |
and
0, and the heat capacity of water and protein solutions, CP and CP0, respectively.
The following relationship (45
,46
) may be used to decompose the apparent experimental protein compressibility into an intrinsic protein compressibility,
, and a hydration water contribution:
![]() | (9) |
The hydration contribution reflects the change in the compressibility of the solvent as a result of interactions with the surface-exposed protein atoms. Finally, protein intrinsic compressibility,
, can be calculated as
![]() | (10) |
In this article, we outline a procedure for straightforward determination of the protein intrinsic compressibility,
, from the apparent compressibility determined experimentally,
. The central idea of the procedure is the evaluation of
for any protein of known three-dimensional structure.
Computational methods
Simulations
MD simulations of trypsin, ribonuclease A, lysozyme, and
-lactalbumin were performed for 1 ns in an isothermal-isobaric (NPT) ensemble (constant number of particles, N, constant pressure, P = 1 atm, and constant temperature, T = 300 K), with periodic boundary conditions and the particle-mesh Ewald method (47
) for the calculations of the electrostatic interactions. Constant temperature and pressure were maintained using the Nosé-Hoover method of coupling to a heat bath and extended system algorithm for controlling the pressure, with a coupling constant of 25 ps. The computer simulations and post-processing of the generated trajectories are performed with CHARMM (48
). More details of the MD simulations for the four globular proteins are described elsewhere (44
). MD simulations were also performed for four small dipeptides: alanine dipeptide, glutamic acid dipeptide, aspartic acid dipeptide, and lysine dipeptide. Each CH3-blocked dipeptide was solvated in a cubic box of 1000 TIP3P water molecules and MD simulations were performed using the same protocol as for proteins (details of the simulations have appeared (49
)).
Compressibility calculation
Isothermal compressibility, ßT, for proteins, protein and dipeptide solutions, and pure water (TIP3P model), is determined from volume fluctuations,
V2, and average volumes,
V
, according to
![]() | (11) |
The protein molecular volume, V(t), is determined using a grid-based, atomic volume expansion algorithm, that allows for the protein interstitial spaces to be included in the total volume together with the protein van der Waals volume (44
). Protein volumes are evaluated every ps from a 1-ns equilibrium trajectory with a 0.2 Å grid spacing. The average and standard deviations of the volume distribution are then used to calculate protein intrinsic compressibility,
. The isothermal compressibility evaluated in this manner correlates well with the experimental compressibility. As pointed out in a recent review (7
), the suitability of any method for protein volume calculation is validated by the correlation with experiment. The compressibility of the protein and dipeptide solutions is also determined from Eq. 11, using the histogram of the time series of the whole system (simulation box) volume, Vsol(t), rather than molecular volumes.
Errors due to finite grid spacing
We have calculated protein total volume using a grid spacing,
= 0.2 Å. An upper bound for the errors introduced in the volume calculation by the finite grid spacing can be obtained as follows. Let us assume that our protein is spherical, with a radius of 15 Å. An estimate of the error in volume calculation is given by the ratio of the spherical shell volume at the protein surface with a width of
R =
, 4
R2
, and the volume of the whole sphere, 4
R3/3. For our particular case, this ratio is 3 x 0.2/15 = 0.04, i.e., 4%. Following the rationale for error calculation described earlier (44
), the error in compressibility calculation due to the calculation of volume with a finite grid spacing formula is 4%. This is a maximal error that assumes that all the protein surface is convex, such as the case with a spherical protein. In fact, the protein surface has both convex and concave local geometry (peaks and valleys) and some of the errors (overestimates and underestimates) will cancel out. The total error in the calculation of the total protein volume with a grid spacing of 0.2 Å for a globular protein with a radius of gyration of 15 Å is smaller than 4%.
Atomic categories
To calculate the charged, polar, and apolar contributions to
, surface-exposed protein atoms are placed in three categories: charged (crg); polar (pol); or apolar (apl). Charged atoms include the side-chain carboxyl groups of Asp and Glu, the guanidinium group of Arg, the side-chain amino group of Lys, the amino and carboxylate termini of the polypeptide chain, and the charged Ca2+ ions in
-lactalbumin and trypsin. The polar category includes atoms with an absolute partial charge |qi| > 0.3 e, excluding those atoms defined as charged. All other atoms are apolar atoms. The solvent-accessible surface areas contributed by charged, polar, and apolar atoms (SAStype, type = crg, plr, aplr), as well as the total SAS are calculated with CHARMM with a probe radius of 1.4 Å (based on the algorithm of Lee and Richards (50
)).
Radial distribution functions
The radial distribution function (RDF), gAB(R), of atom A, at a distance R from a central atom, B, can be calculated from the normalized histogram of the numbers of atoms, nA(R + 0.5
R), in a bin of width
R:
![]() | (12) |
We calculate the
of water oxygen atoms surrounding charged, polar, and apolar groups using CHARMM. The space around each solvent-exposed protein (heavy) atom is divided into concentric spherical shells with a width of 0.1 Å for a total of 100 shells, with R ranging from 0 to 10 Å from the center of each heavy atom on the protein surface. The oxygen-oxygen gOO(R) values in ambient water are calculated from a 20-ns simulation of 2000 water molecules at constant pressure and temperature with periodic boundary conditions and the particle-mesh Ewald (47
) summation for the electrostatic interactions. The protein-water RDFs do not converge to 1; instead, they converge to
0.62. This is because the volume occupied by the protein is not accessible to the solvent. We renormalized the protein-water RDFs (i.e., multiplied the original curves by 1.61) to converge to 1.0 for easier comparison with gOO(R) of pure water-TIP3 model.
| RESULTS |
|---|
|
|
|---|
h.
Interface-specific RDFs
The interface-specific, cumulative RDF, gc(R), is a sum over all same-type (charged, polar, or apolar) surface-exposed atomic RDFs:
![]() | (13) |
SAS
larger than 10.0 Å2, to allow full contact with one water molecule.
Differences in the cumulative RDFs around charged (red curves), polar (green curves), and apolar (black curves) atoms are shown for trypsin (Fig. 1 A) and
-lactalbumin (Fig. 1 B). The sum of the converged values of the cumulative RDFs is proportional to the total number of heavy atoms that are solvent-exposed in each globular protein. Radial distribution functions, g(R), for the three types of interface are obtained from the normalized cumulative RDFs by scaling with the trajectory-averaged numbers of atoms of each type that are surface-exposed,
. Three distinct g(R) values, each representing the hydration of a particular type of protein-water interface, are obtained in Fig. 1, C (trypsin) and D (
-lactalbumin).
|
In liquid water under ambient conditions, each water molecule is surrounded on average by 3.8 closest neighbors. At the protein-water interface, approximately half of the configurational space is available to water, which means that each heavy atom should be in the proximity of
1.9 water molecules. We calculate the average number of hydration waters in the protein first hydration shell (to a radial distance of 3.2 Å from the protein atoms), and find that there are 2.7 water molecules hydrating charged groups, 1.5 water molecules for polar groups, and 1.1 for apolar groups.
The results presented here provide a physical basis for the decomposition of the protein hydration volume into contributions from charged, polar, and apolar hydration volumes. No discernable differences in the protein-water RDFs and bulk water are detected beyond 8 Å from the protein surface. MD simulations of proteins typically include 23 layers of water molecules separating the protein from the wall of the simulation box. As such, the majority of the water in the simulation box can be approximately characterized as hydration water.
Local environment
At the protein-water interface, the individual atomic group-specific RDFs are influenced by two factors of the local environment: the unique configuration of neighboring amino acids, and the local topology of the solvent-accessible surface, which could be convex or concave. Effects of the local environment are manifested in the differences on the RDFs for the same chemical group from individual residues. The RDFs of same-type atomic group vary somewhat in magnitude but the structural distinctions between atomic types apparent in the RDFs in Fig. 1 are consistently manifested for individual residues (results given in Supplementary Material). Difference in positions of the first hydration shell are due partly to local surface topology. That is, the maximum position at smaller distances for charged groups reflects the convex surface where these residues are generally located, while the shift toward larger distances is consistent with a concave surface for apolar groups.
Model: hydration shell compressibility as a function of the chemical nature of the protein-water interface
To assess the change in compressibility due to protein hydration we formulate a simple model where the compressibility of the hydration water,
, has contributions dependent on the nature of the protein surface exposed to water: charged, polar, and apolar. To estimate these contributions, we first identify the hydration contribution to the solution compressibility as the residual compressibility,
(Eq. 6)the difference between the actual isothermal compressibility of the protein-water solution (
estimated from simulation)and an ideal protein-water solution compressibility, which does not take into account changes in the solvent compressibility due to protein solvation,
. If water and the protein molecule are assumed to be independent components of the protein-water system,
can be estimated from the intrinsic protein compressibility,
, and the compressibility of the bulk water,
, weighted by their corresponding volume fractions,
p and
h +
w, according to Eq. 5.
The actual isothermal compressibility of the protein-water solutions and bulk water systems are calculated from the whole simulation box average volume,
Vsol
, and fluctuations,
, according to Eq. 11. The calculated protein-water solution compressibility,
(Table 1), is between 52 and 55 106 atm1. The compressibility of bulk water (TIP3P model) calculated from whole system volume fluctuations is 61 106 atm1 (44
,51
). Intrinsic, isothermal protein compressibility,
, is calculated from protein volume fluctuations,
V2, and average volumes,
V
, as described previously (44
). The values
and
are listed in Table 1.
|
range from 2.7 x 106 atm1 for trypsin to 0.83 x 106 atm1 for HEW lysozyme (Table 1). Such a range in
implies a nonuniform contribution from the protein surface that could arise from differences in the hydration of surface groups with different polarity.
We use a simple, linear model to decompose
(Eq. 6), into contributions from changes in the hydration water compressibility due to charged, polar, and apolar surfaces. The volume of the hydration water is decomposed into the volume near charged surfaces, Vh-crg, the volume near polar surfaces, Vh-plr, and the volume near apolar surfaces, Vh-aplr. The corresponding volume fractions are
h-crg = Vh-crg/Vh,
h-plr = Vh-plr/Vh, and
h-aplr = Vh-aplr/Vh. In a first approximation, the hydration volume fractions can be replaced by the respective surface fractions as
, and
. With this approximation,
becomes
![]() | (14) |
are calculated from
and
values (Table 1) by using a least-squares maximum likelihood method,
,
, and
(in units of 106 atm1; see Table 2). The compressibility of hydration water surrounding charged, polar, and apolar surfaces are the values
,
, and
. The estimates for
reproduce the original
with <10% error (last row in Table 1).
|
|
|
![]() | (15) |
The value v0 is the average volume of one water molecule in the hydration shell and
and
nh
are the fluctuations and averages of the number of hydration water molecules in the first hydration shell, nh. The value nh(t) is a time series of the number of water molecules within a shell of 3.1 Å surrounding the protein.
The average volume of a water molecule in the first hydration layer is estimated using the average solvent-accessible surface area, the width of the layer, and the average number of hydration waters: v0 =
SAS
x 3.1/
nh
. The average of v0 for the four globular proteins is 23 Å3. This value is smaller than the average volume of a water molecule in bulk, 29.7 Å3, but close to the volume of water molecules hydrating proteins obtained from high resolution crystal structures of 22 globular proteins (52
), 23.8 Å3.
We calculated
for trypsin and HEW lysozyme from
nh
= 676 for trypsin and
nh
= 456 for HEW lysozyme, the fluctuations in the number of water molecules in the first hydration shell,
(trypsin) and 5.84 (HEW lysozyme), and the average volume of one water molecule in the hydration shell, v0. With these values, the calculated compressibility of the first hydration layer is 47.2 106 atm1 for trypsin and 41.6 106 atm1 for HEW lysozyme. The calculated hydration compressibility values for the first solvation layer are 77% of bulk water compressibility for trypsin and 68% of bulk water compressibility for HEW-lysozyme. The lower compressibility of the first hydration layer of HEW lysozyme relative to trypsin is consistent with the model (Eq. 14), given the difference in the charged solvent-accessible surface areas exposed by the two proteins to water. However, this is only an approximation for the compressibility of the first hydration shell and does not represent the whole hydration water compressibility. Equation 15 is strictly valid for compressibility calculation in an NVT ensemble.
Application to experiment: a formula for obtaining intrinsic protein compressibility from the experimentally determined compressibility
We can now outline a simple algorithm for the calculation of protein intrinsic isothermal compressibility,
from the experimental isothermal protein compressibility,
. First, one must estimate
from the charged, polar, apolar, and total SAS of the three-dimensional structure of the protein. The value
is then calculated from
and
using Eq. 14. Next, protein intrinsic compressibility,
can be calculated from Eq. 9 by subtracting
from the experimental isothermal compressibility.
The algorithm can be used to decompose the experimental compressibility for any protein of known three-dimensional structure. For the application, it is necessary to reconcile the difference in the calculated and experimental isothermal compressibility of bulk water. Bulk water compressibility estimated from sound velocity measurements is 45 x 106 atm1 in ambient conditions (53
) while the calculated compressibility for pure water, the TIP3P model, is 61 x 106 atm1. The calculated
must be rescaled by a factor of 45:61 to be compatible with the experiment (Table 2).
The use of Eq. 9 for the calculation of the intrinsic protein compressibility,
, requires an estimate of the ratio between the hydration and protein volume fractions,
h/
p. Here we use a simple approximation dictated by the convergence of the RDFs around protein molecules: no differences in the RDFs between the bulk water and hydration water can be observed at distances, D, larger than 8 Å from the protein surface (Figs. 1 and 2). For a globular protein with a radius of gyration, Rgyr, the ratio between the hydration and protein volumes can be approximated by
![]() |
h/
p ratio is 2.6.
NMR absorption equilibrium experiments (54
) have found that for HEW-lysozyme, the onset of drastic decrease in relaxation rate occurs at 1.7 g of water for 1 g of protein. If the water density is 1 g/cm3 and protein density is 1.43 g/cm3,
h/
p = 1.7 x 1.43 = 2.43. The ratio of
h/
p obtained using the guidelines provided by the RDFs is in good agreement with estimates from NMR experiments. Both methods estimate that hydration of a regular protein such as lysozyme requires a hydration water volume that is approximately two-and-a-half times larger than the protein volume.
Examples
The method is illustrated using three proteins of known structure: lysozyme,
-lactalbumin, and myoglobin. The value
have been estimated by Gekko and Hasegawa using Eq. 8 (38
). The
values are estimated here using the adjusted (to experiment)
from Table 2 and the corresponding fractions of protein-water interfaces,
. To obtain the intrinsic protein compressibility we estimate the corrections due to protein hydration using the ratio of
h/
p = 2.43. This leads to intrinsic compressibility corrections ranging from 1.13 106 atm1 for
-lactalbumin and 4.37 106 atm1 for myoglobin. Final protein intrinsic compressibilities are listed in Table 3.
|
-lactalbumin, are part of the set of proteins for which the intrinsic isothermal compressibility was calculated from volume fluctuations in MD simulations (44
(
V2) (Table 3) (44
-lactalbumin. For myoglobin, our method predicts a large contribution to the intrinsic isothermal compressibility from hydration, 4.4 106 atm1. This estimate remains to be verified through other methods. The match between the intrinsic protein compressibility calculated from molecular volume fluctuations and from estimates of hydration contributions to protein compressibility provides a first validation of the theoretical model for protein hydration compressibility.
Hydration water compressibility around small compounds
Others have discussed the possibility that the hydration characteristics of atomic groups at the protein surface can be modeled from hydration properties of similar groups in small molecules. It was proposed that protein hydration water compressibility can be estimated from the hydration water compressibility of small compounds (39
,55
,56
), but the idea was challenged on the basis that the environment of basic groups in proteins is different from that of small molecules in solution or, alternatively, because the charged groups in proteins may not be as exposed to the solvent as those in small molecules ((7
) and references therein). The calculation of hydration water compressibility from MD simulations offers the opportunity for a direct evaluation of this proposal.
We determine the solution compressibility of four dipeptides in water,
, from whole system volume fluctuations and the predicted solution compressibility,
, based on volume fraction of water and peptide solute. Values for
SAS
crg,
SAS
plr,
SAS
aplr, and total
SAS
, their respective fractions, as well as peptide and simulation box volumes for small compounds, are presented in Table 4. The intrinsic compressibility is very small for peptides because of the insignificant amount of interstitial space shielded from water, the major source of intrinsic compressibility (44
), and the negligible compressibility of the covalent bonds (4
). The
and other values calculated from the simulations are shown in Table 4 for four dipeptides: alanine, glutamic, aspartic, and lysine dipeptides. For all four compounds, the dipeptide volume fraction,
p, is small,
2% of the total solution volume and
w is 98%. The term
0.0 and the main contributions to the predicted solution compressibility are from water,
.
for the four small compounds are listed in Table 4.
The
was estimated for the four dipeptides using
and
shown in Table 4, and
h = 0.5. The optimized values (Table 2) for
from the dipeptides for our small compounds are as follows:
,
, and
(in units of 106 atm1). After normalization with experimental isothermal compressibility of bulk water,
,
, and
, and the isothermal compressibilities of water near peptide surfaces are
,
, and
. For small compounds, the compressibility of the hydration water surrounding all types of atomic groups is lower than the compressibility of bulk water. The ranking among types of hydration water compressibility is the same as that for proteins in that
. The estimated values for hydration water compressibility around the dipeptides studied here reproduce
with <5% error (calculated errors are listed in Table 4).
The
estimates for the dipeptides agree qualitatively with various experimental estimates for small compounds (4
,57
). It was found that at 25°C the charged amino
and carboxyl-COO contribute have large negative partial molar adiabatic compressibility, 34 cm3 mol1 atm1. By contrast, for uncharged species, the partial molar adiabatic compressibility could be either positive (2.74 cm3 mol1 atm1 for glycolamide and 1 cm3 mol1 atm1 for the peptide CONH group) or negative (2 cm3 mol1 atm1 for the OH groups in pentoses and hexoses, and 12.5 cm3 mol1 atm1 for ribose).
Hydration properties of proteins and peptides
A comparison between the changes in hydration water compressibility around proteins and dipeptides reveals that the charged groups, and to a lesser extent the polar groups, are hydrated similarly in proteins and dipeptides, both exhibiting a decrease in the hydration water compressibility with respect to bulk water. Apolar groups have opposite sign contributions in proteins and dipeptides, with an increase in hydration water compressibility around apolar groups in proteins.
The two sets of
(Table 2) cannot be transferred between proteins and peptides. We have calculated errors in
for proteins using the values
from dipeptides and, inversely, we attempted to reproduce the
for dipeptides using the
for proteins. Errors larger than 20% were obtained in all trials, with particularly large errors in the
of proteins when using
from dipeptides. One possible explanation for the differences in the compressibility of hydration contributions of same surface types in proteins and small compounds is the intrinsic differences in the size of the molecules (which translates into a significantly large difference of the curvature of the solute-water interface) and the particular local arrangements of adjacent charged, polar, and apolar surfaces in proteins. Based on the results presented here, we conclude that the hydration properties of native globular proteins cannot be modeled using small molecule data. Therefore, even though the charged, polar, and apolar contributions to hydration compressibility for small compounds are ranked the same way as those in proteins (
), their magnitude is different for proteins and small compounds and the results are not interchangeable.
RDFs of water molecules around proteins and small dipeptides
The perturbation of the water structure around small molecules is compared to that around proteins by consideration of the RDFs in Fig. 2. While several common features exist, significant differences are also observed. One characteristic aspect of the small molecule RDFs is that the structuring of water extends to 1012 Å from each given atom in the dipeptide. For proteins, there is no noticeable water structuring extending beyond 810 Å from the protein surface. One possible explanation is that for proteins, long-range structuring of water is averaged out by the overlapping hydration shells originating in the adjacent charged, polar, and apolar surfaces.
The main differences between the RDFs from apolar surfaces (Fig. 2 A) in proteins (maroon curve) and dipeptides (violet curve) are the two shoulders at short (2.7 Å) and intermediate (4.8 Å) distance in the RDF of the apolar surface of proteins. The scaled (by 1:2) RDF from the charged protein surfaces, the red (dotted) curve in panel A, is also shown in Fig. 2, to indicate that the positions of the shoulders in the RDFs from the protein apolar surface coincide with the positions of the peaks in the RDFs from the charged surface. The RDFs of water surrounding apolar surfaces are modulated by contributions from water molecules in proximity of charged surfaces in proteins, a direct effect of adjacent and intertwining charged and apolar protein surface patches on the protein solvent-accessible surface. The net result is an overlap in the hydration shells of adjacent apolar and charged solvent-accessible surface areas.
Fig. 2 B represents the RDFs for charged surfaces in proteins (red curve) and peptides (orange curve). The blue curve represents the oxygen-oxygen RDF for TIP3 water. The location of the first hydration shells is displaced by .1 Å in proteins with respect to peptides, i.e., the first water oxygen is in average closer to the peptide surface than to the protein surface, a possible consequence of the different curvature at the solute-solvent interface (with a smaller curvature for the protein surface). The second peaks are located at roughly the same distance, 4.7 Å, but the shape and the width of the peaks are different.
Differences in the specific charged, polar, and apolar hydration in proteins and small peptides originate from both chemistry and the local topology and geometry of the solvent-accessible surface. The overall curvature of the solvent-accessible surface for a 150-residue protein is
1:15 Å1 and much larger, 1:3.5 Å1, for dipeptides. The local geometry at the protein-water interface is either convex or concave, corresponding to peaks and valleys on the protein solvent-accessible surface. As a general trend, the charged groups will tend to be on the peaks and the apolar groups on the valleys. Because of their smaller size, this tendency is less strictly followed for small peptides. Global and local differences in the geometry of the solvent-accessible surfaces between proteins and peptides are partially responsible for the fact that data from small compounds cannot be used to estimate the compressibility of water surrounding the protein.
| DISCUSSION |
|---|
|
|
|---|
. From investigations of hydration characteristics of atomic groups at the surface of similar groups in small molecules, we find that the compressibility of protein hydration water can only be assessed from protein-based additive contributions from charged, polar, and apolar surfaces. These contributions are specific to proteins and cannot be modeled based on similar data obtained for small molecules.
Using the detailed information from MD simulations, we have outlined a protocol for separating intrinsic protein and hydration water contributions to experimental compressibility obtained from sound velocity measurements. The formulation in Eq. 14 allows for the intrinsic protein compressibility to be calculated using the chemically specific
and the fractions of charged, polar, and apolar SASA for any protein of known three-dimensional structure.
Our analysis of hydration contribution to protein intrinsic compressibility is similar in spirit to that of Breslauer and co-workers (13
). These authors have determined hydration contributions from cross-correlating thermodynamic and structural data for globular proteins. Our formulation combines information from protein volume fluctuations and the radial distribution functions of water around protein and small molecules to arrive at a simple and easy to implement formula for the intrinsic protein compressibility, while the changes in the compressibility of hydration water are explicitly included.
Interpretation of experimental data on compressibility change due to protein conformational transitions and protein-ligand binding (6
) can benefit from the formulation presented here. It is a matter of estimating the change in each type of accessible surface area. As an example, in a detailed experimental investigation of the change in the partial specific adiabatic compressibility of globular proteins, Chalikian and Breslauer (5
) show that the partial specific adiabatic compressibility of globular proteins decreases upon unfolding. A decrease in the compressibility is also observed in the native to partially unfolded transition, while the transition to a compact intermediate state results in an increase in compressibility. The behavior may be rationalized from our model of hydration compressibility. When the protein unfolds, a large fraction of the previously buried apolar surface is exposed to water. The newly created hydration water layer surrounding the freshly exposed apolar groups has a higher compressibility than bulk water. The effect is a net increase in the compressibility of the hydration water and a decrease in the negative component of hydration contribution to the total compressibility change. Because the intrinsic contribution to the compressibility of the protein unfolded state is practically negligible, the sole contribution to solution compressibility is the hydration water compressibility. The
and the unfolded state
allow for an estimate of the compressibility of the protein in the unfolded state for any protein of known sequence.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Submitted on April 24, 2006; accepted for publication August 31, 2006.
| REFERENCES |
|---|
|
|
|---|
2. Eden, D., J. B. Matthew, J. J. Rosa, and F. M. Richards. 1982. Increase in apparent compressibility of cytochrome c upon oxidation. Proc. Natl. Acad. Sci. USA. 79:815819.
3. Phelps, D. K., and C. B. Post. 1995. A novel basis for capsid stabilization by antiviral compounds. J. Mol. Biol. 254:544551.[CrossRef][Medline]
4. Chalikian, T. V., A. P. Sarvazyan, and K. J. Breslauer. 1994. Hydration and partial compressibility of biological compounds. Biophys. Chem. 51:89109.[CrossRef][Medline]
5. Chalikian, T. V., and K. J. Breslauer. 1996. Compressibility as a means to detect and characterize globular protein states. Proc. Natl. Acad. Sci. USA. 93:10121014.
6. Gekko, K. 2002. Compressibility gives new insight into protein dynamics and enzyme function. Biochim. Biophys. Acta. 1595:382386.[CrossRef][Medline]
7. Chalikian, T. V. 2003. Volumetric properties of proteins. Annu. Rev. Biophys. Biomol. Struct. 32:207235.[CrossRef][Medline]
8. Gekko, K., A. Kimoto, and T. Kamiyama. 2003. Effects of disulfide bonds on compactness of protein molecules revealed by volume, compressibility, and expansibility changes during reduction. Biochemistry. 42:1374613753.[CrossRef][Medline]
9. Taulier, N., I. V. Beletskaya, and T. V. Chalikian. 2005. Compressibility changes accompanying conformational transitions of apomyoglobin. Biopolymers. 79:218229.[CrossRef][Medline]
10. Dubins, D. N., R. Filfil, R. B. Macgregor, and T. V. Chalikian. 2003. Volume and compressibility changes accompanying thermally induced native-to-unfolded and molten globule-to-unfolded transitions of cytochrome c: A high pressure study. Biochemistry. 42:86718678.[CrossRef][Medline]
11. Chalikian, T. V., and R. Filfil. 2003. The thermodynamics of protein-protein recognition as characterized by a combination of volumetric and calorimetric techniques: the binding of turkey ovomucoid third domain to alpha-chymotrypsin. J. Mol. Biol. 326:12711288.[CrossRef][Medline]
12. Jung, C. 2002. Cytochrome p-450-co and substrates: lessons from ligand binding under high pressure. Biochim. Biophys. Acta. 1595:309328.[CrossRef][Medline]
13. Chalikian, T. V., M. Totrov, R. Abagyan, and K. J. Breslauer. 1996. The hydration of globular proteins as derived from volume and compressibility measurements: cross correlating thermodynamic and structural data. J. Mol. Biol. 260:588603.[CrossRef][Medline]
14. Dadarlat, V. M., and C. B. Post. 2003. Adhesive-cohesive model for protein compressibility: an alternative perspective on stability. Proc. Natl. Acad. Sci. USA. 100:1477814783.
15. Steinbach, P., and B. Brooks. 1993. Protein hydration elucidated by molecular dynamics simulation. Proc. Natl. Acad. Sci. USA. 90:91359139.
16. Schroder, C., T. Rudas, S. Boresch, and O. Steinhauser. 2006. Simulation studies of the protein-water interface. I. Properties at the molecular resolution. J. Chem. Phys. 124:234907.[CrossRef][Medline]
17. Rudas, T., C. Schroder, S. Boresch, and O. Steinhauser. 2006. Simulation studies of the protein-water interface. II. Properties at the mesoscopic resolution. J. Chem. Phys. 124:234908.[CrossRef][Medline]
18. Vitkup, D., D. Ringe, G. A. Petsko, and M. Karplus. 2000. Solvent mobility and the protein "glass" transition. Nat. Struct. Biol. 7:3438.[CrossRef][Medline]
19. Rupley, J. A., and G. Careri. 1991. Protein hydration and function. Adv. Protein Chem. 41:37172.[Medline]
20. Pollack, G. H. 2001. Is the cell a geland why does it matter? Jpn. J. Physiol. 51:649660.[CrossRef][Medline]
21. Pollack, G. H. 2003. The role of aqueous interfaces in the cell. Adv. Colloid Interface Sci. 103:173196.[CrossRef][Medline]
22. Brunne, R. M., E. Liepinsh, G. Otting, K. Wuthrich, and W. F. van Gunsteren. 1993. Hydration of proteins: a comparison of experimental residence times of water molecules solvating the bovine pancreatic trypsin inhibitor with theoretical model calculations. J. Mol. Biol. 231:10401048.[CrossRef][Medline]
23. Badger, J. 1993. Multiple hydration layers in cubic insulin crystals. Biophys. J. 65:16561659.
24. Svergun, D. I., S. M. Richard, H. J. Koch, Z. Sayers, S. Kuprin, and G. Zaccai. 1998. Protein hydration in solution: experimental observation by x-ray and neutron scattering. Proc. Natl. Acad. Sci. USA. 95:22672272.
25. Venu, K., L. A. Svensson, and B. Halle. 1999. Orientational order and dynamics of hydration water in a single crystal of bovine pancreatic trypsin inhibitor. Biophys. J. 77:10741085.
26. Russo, D., G. Hura, and T. Head-Gordon. 2005. Hydration dynamics near a model protein surface. Biophys. J. 86:18521862.
27. Levitt, M., and R. Sharon. 1988. Accurate simulation of protein dynamics in solution. Proc. Natl. Acad. Sci. USA. 85:75577561.
28. Makarov, V. A., K. A. Andrews, and B. M. Pettitt. 1998a. Protein hydration density: theory, simulation and crystallography. Curr. Opin. Struct. Biol. 8:218221.[CrossRef][Medline]
29. Makarov, V. A., K. A. Andrews, and B. M. Pettitt. 1998b. Reconstructing the protein-water interface. Biopolymers. 45:469478.[CrossRef][Medline]
30. Makarov, V. A., B. K. Andrews, P. E. Smith, and B. M. Pettitt. 2000. Residence times of water molecules in the hydration sites of myoglobin. Biophys. J. 79:29662974.
31. Henchman, R. H., and J. A. McCammon. 2002. Structural and dynamic properties of water around acetylcholinesterase. Protein Sci. 11:20802090.
32. Halle, B., and M. Davidovic. 2003. Biomolecular hydration: from water dynamics to hydrodynamics. Proc. Natl. Acad. Sci. USA. 100:1213512140.
33. Pratt, L. R., and D. Chandler. 1977. Theory of the hydrophobic effect. J. Chem. Phys. 67:36833704.[CrossRef]
34. Lazaridis, T., and M. E. Paulaitis. 1992. Entropy of hydrophobic hydration: a new statistical mechanical formulation. J. Chem. Phys. 96:3847.[CrossRef]
35. Lazaridis, T., and M. E. Paulaitis. 1994. Simulation studies of the hydration entropy of simple, hydrophobic solutes. J. Phys. Chem. 98:635.