| Quantifying the Kinetic Paths of Flexible Biomolecular Recognition Biophysical Journal, Volume 91, Issue 3, 1 August 2006, Pages 866-872 Jin Wang, Kun Zhang, Hongyang Lu and Erkang Wang Abstract Biomolecular recognition often involves large conformational changes, sometimes even local unfolding. The identification of kinetic pathways has become a central issue in understanding the nature of binding. A new approach is proposed here to study the dynamics of this binding-folding process through the establishment of a path-integral framework on the underlying energy landscape. The dominant kinetic paths of binding and folding can be determined and quantified. The significant coupling between the binding and folding of biomolecules often exists in many important cellular processes. In this case, the corresponding kinetic paths of binding are shown to be intimately correlated with those of folding and the dynamics becomes quite cooperative. This implies that binding and folding happen concurrently. When the coupling between binding and folding is weak (strong), the kinetic process usually starts with significant folding (binding) first, with the binding (folding) later proceeding to the end. The kinetic rate can be obtained through the contributions from the dominant paths. The rate is shown to have a bell-shaped dependence on temperature in the concentration-saturated regime consistent with experiment. The changes of the kinetics that occur upon changing the parameters of the underlying binding-folding energy landscape are studied. Abstract | Full Text | PDF (300 kb) |
| Studies of Proton Translocations in Biological Systems: Simulating Proton Transport in Carbonic Anhydrase by EVB-Based Models Biophysical Journal, Volume 87, Issue 4, 1 October 2004, Pages 2221-2239 Sonja Braun-Sand, Marek Strajbl and Arieh Warshel Abstract Proton transport (PTR) processes play a major role in bioenergetics and thus it is important to gain a molecular understanding of these processes. At present the detailed description of PTR in proteins is somewhat unclear and it is important to examine different models by using well-defined experimental systems. One of the best benchmarks is provided by carbonic anhydrase III (CA III), because this is one of the few systems where we have a clear molecular knowledge of the rate constant of the PTR process and its variation upon mutations. Furthermore, this system transfers a proton between several water molecules, thus making it highly relevant to a careful examination of the “proton wire” concept. Obtaining a correlation between the structure of this protein and the rate of the PTR process should help to discriminate between alternative models and to give useful clues about PTR processes in other systems. Obviously, obtaining such a correlation requires a correct representation of the “chemistry” of PTR between different donors and acceptors, as well as the ability to evaluate the free energy barriers of charge transfer in proteins, and to simulate long-time kinetic processes. The microscopic empirical valence bond (Warshel, A., and R. M. Weiss. 1980. . 102:6218–6226; and Åqvist, J., and A. Warshel. 1993. . 93:2523–2544) provides a powerful way for representing the chemistry and evaluating the free energy barriers, but it cannot be used with the currently available computer times in direct simulation of PTR with significant activation barriers. Alternatively, one can reduce the empirical valence bond (EVB) to the modified Marcus’ relationship and use semimacroscopic electrostatic calculations plus a master equation to determine the PTR kinetics (Sham, Y., I. Muegge, and A. Warshel. 1999. . 36:484–500). However, such an approximation does not provide a rigorous multisite kinetic treatment. Here we combine the useful ingredients of both approaches and develop a simplified EVB effective potential that treats explicitly the chain of donors and acceptors while considering implicitly the rest of the protein/solvent system. This approach can be used in Langevin dynamics simulations of long-time PTR processes. The validity of our new simplified approach is demonstrated first by comparing its Langevin dynamics results for a PTR along a chain of water molecules in water to the corresponding molecular dynamics simulations of the fully microscopic EVB model. This study examines dynamics of both models in cases of low activation barriers and the dependence of the rate on the energetics for cases with moderate barriers. The study of the dependence on the activation barrier is next extended to the range of higher barriers, demonstrating a clear correlation between the barrier height and the rate constant. The simplified EVB model is then examined in studies of the PTR in carbonic anhydrase III, where it reproduces the relevant experimental results without the use of any parameter that is specifically adjusted to fit the energetics or dynamics of the reaction in the protein. We also validate the conclusions obtained previously from the EVB-based modified Marcus’ relationship. It is concluded that this approach and the EVB-based model provide a reliable, effective, and general tool for studies of PTR in proteins. Finally in view of the behavior of the simulated result, in both water and the CA III, we conclude that the rate of PTR in proteins is determined by the electrostatic energy of the transferred proton as long as this energy is higher than a few kcal/mol. Abstract | Full Text | PDF (917 kb) |
| A Structural Model of Polyglutamine Determined from a Host-Guest Method Combining Experiments and Landscape Theory Biophysical Journal, Volume 87, Issue 3, 1 September 2004, Pages 1900-1918 John M. Finke, Margaret S. Cheung and José N. Onuchic Abstract Modeling the structure of natively disordered peptides has proved difficult due to the lack of structural information on these peptides. In this work, we use a novel application of the host-guest method, combining folding theory with experiments, to model the structure of natively disordered polyglutamine peptides. Initially, a minimalist molecular model (CC) of CI2 is developed with a structurally based potential and captures many of the folding properties of CI2 determined from experiments. Next, polyglutamine “guest” inserts of increasing length are introduced into the CI2 “host” model and the polyglutamine is modeled to match the resultant change in CI2 thermodynamic stability between simulations and experiments. The polyglutamine model that best mimics the experimental changes in CI2 thermodynamic stability has 1), a -strand dihedral preference and 2), an attractive energy between polyglutamine atoms 0.75-times the attractive energy between the CI2 host Go-contacts. When free-energy differences in the CI2 host-guest system are correctly modeled at varying lengths of polyglutamine guest inserts, the kinetic folding rates and structural perturbation of these CI2 insert mutants are also correctly captured in simulations without any additional parameter adjustment. In agreement with experiments, the residues showing structural perturbation are located in the immediate vicinity of the loop insert. The simulated polyglutamine loop insert predominantly adopts extended random coil conformations, a structural model consistent with low resolution experimental methods. The agreement between simulation and experimental CI2 folding rates, CI2 structural perturbation, and polyglutamine insert structure show that this host-guest method can select a physically realistic model for inserted polyglutamine. If other amyloid peptides can be inserted into stable protein hosts and the stabilities of these host-guest mutants determined, this novel host-guest method may prove useful to determine structural preferences of these intractable but biologically relevant protein fragments. Abstract | Full Text | PDF (714 kb) |
Copyright © 2007 The Biophysical Society. All rights reserved.
Biophysical Journal, Volume 92, Issue 12, L109-L111, 15 June 2007
doi:10.1529/biophysj.107.105551
Biophysical Letters
Jin Wang*, †,
,
, Li Xu* and Erkwang Wang*,
, 
* State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, People's Republic of China
† Department of Chemistry and Department of Physics, State University of New York at Stony Brook, Stony Brook, New York
Address reprint requests and inquiries to Jin Wang; or Erkang Wang.The study of associations between two biomolecules (e.g., proteins, RNA, or DNA) is the key to understanding molecular recognition and function. A standard paradigm, which has been successfully applied for many enzyme proteins, is that molecular function (e.g., binding) is determined by molecular structure. The lock-and-key mechanism of binding assumes that biomolecules maintain rigid structures during association 1. The induced fit mechanism 2 suggests that biomolecules can adjust their conformations to a limited extent during the association. In nature, however, molecular binding often involves large conformational changes in various stages of cell function. It has been estimated that up to 30% of proteins, when isolated, are in their unfolded or partially disordered form 3,4,5,6. Because the final native binding state is usually well structured, this implies that binding toward the native state occurs concomitantly with large conformational changes (e.g., folding). The flexible or disordered form of the proteins in the cells can be targeted for rapid turnover, thus providing an additional lever of control. Here, flexibility rather than rigidity is crucial for binding as well as for biological function. However, flexible binding processes are not yet very well understood. An understanding of how flexibility might help molecular recognition and function is one of the most challenging tasks facing molecular biologists. Addressing this issue can lead to a new paradigm in molecular biology—one that will answer critical questions of how molecular function is determined by flexibility and dynamics, in addition to structure.
Some recent experiments have begun to investigate the mechanisms of flexible binding 7. However, there are so far limited theoretical investigations on this 4,5,6,8. Recently we carried out theoretical studies on biomolecular binding at the interface 9, which provided a basis for studying the more general case of flexible binding.
In this study, we will first construct a thermodynamic energy landscape for molecular recognition and address the roles of flexibility in determining the binding affinity and functional specificity. Affinity and specificity are the two key factors in molecular recognition. Affinity measures the stability resulting from the association of two molecules; specificity is the ability of one molecule to bind with another molecule while discriminating against others. For rigid binding, affinity, and specificity are often correlated. Yet, in flexible binding, flexibility can enable molecules to adjust their conformations to reach the best fit (e.g., high specificity). Quantifying the specificity as well as affinity in flexible binding is crucial in uncovering the mechanism of flexible binding.
Flexible binding involves both binding and conformational degrees of freedom. Thus we need at least three reaction coordinates to describe it: Qb, fraction of native spatial contacts for interface binding; Qf1 and Qf2, fraction of native spatial contacts for flexible conformational change or folding (see Fig. 1. Based on this, we can construct an energy function and derive a free energy landscape F(Qf1, Qf2, Qb).
From the thermodynamic analysis, we expect that the requirement of stable binding against trapping would lead to a funneled binding landscape to guarantee both affinity and specificity 4,5,6,8,9. Only binding with landscape funneled against traps can survive natural evolution, be relatively stable, and perform specific biological functions. With this approach, the role of the interplay between binding and flexibility can be uncovered. Biomolecules need some affinity to be stable but they also need flexibility to adjust to achieve optimal fit and perform specific biological functions. The reality is a balance between the two. We will find an optimal criterion of binding specificity. It can be used for guiding further atomic detailed studies and flexible drug design.
It is known that the fundamental interactions of molecular binding are dominated by the hydrophobic interactions energetically, the electrostatic interactions often acting as the directional force guiding for the molecular binding 10. Binding is guided by the long-range electrostatic interaction. Binding is also controlled by hydrophobic interactions for the thermodynamic stability. The real binding process is the combination of the two. We will focus our attention on the latter process since the first process is studied by many researchers before 10.
We derived the thermodynamic free energy expression for the flexible binding (details in Supplementary Material ):
![]() |
where N1 and N2 are the numbers of the amino acid residues for protein 1 and 2.
is the energy gap or bias per contact toward the native folded state of protein 1,
is the energy gap or bias per contact toward the native folded state of protein 2,
is the energy gap or bias per contact toward the native binding state. T is temperature.
stot (Qf1, Qf2, Qb)=Stot(Qf1, Qf2, Qb)/(N1+N2) is the configurational entropy per contact. The Δɛf1, Δɛf2, Δɛb are the variances or the roughness of the energy landscape per contact for the folding of protein 1, folding of protein 2, and the binding of protein 1 and 2, respectively.
There exists a characteristic temperature where the thermodynamic entropy of the system vanishes and below which the system is completely trapped:
![]() |
This temperature signals the trapping into a low energy conformational state at Qf1, Qf2, Qb We can clearly see
![]() |
when Qf1=Qf2=Qb=0 (nonnative unfolding-unbinding states).
Fig. 2 shows the phase diagram in terms of the combined energy gap δɛ=δE/(N1+N2) and roughness δɛ=δE/(N1+N2) relative to temperature for binding folding energy landscape. There are several phases, the native phase (both binding and folding), partially native phase (native binding but with unfolding phases, native binding but one folded and one unfolded phases, both native folded but unbinding phase, one native folded and one unfolded but unbinding phase), and completely unbinding and unfolded phase. In addition there might be a possible trapping phase for the whole complex.
The native transition temperature Tnative can be determined by setting the free energy equal between native and nonnative phase
![]() |
![]() |
![]() |
To guarantee the thermodynamic stability and discriminate from the local traps, the binding transition temperature Tnative needs to be higher than the local trapping temperature Tg, similar to protein folding 11. We can see the ratio between native phase transition temperature and glassy trapping temperature
monotonically depends on the ratio of gap to roughness ratio Λmodulated by entropy.
Λ can be translated to the structure parameter (representing the degree of biases toward native state relative to the roughness) of the underlying landscape. Thus the binding against trapping becomes the controlling factor determining the thermodynamics. In general we expect that the higher the binding transition temperature (or bias toward the native state) against the trapping temperature (or the roughness of the binding landscape), the more stable the system is. This can naturally lead to an optimization criterion for specificity of the binding-folding process. The real optimization scheme involves total biasing, which is the combination of folding gaps of biomolecules 1 and 2, and binding gap versus variances or traps of the underlying landscape, which is the combination of variances of the two nonnative biomolecules and the one for the binding interface. The Λ then should be significantly >1. This implies that the underlying combined energy landscape of folding and binding should be funneled toward the native state (Fig. 1).
Let us discuss the implications of the above specificity criteria. Protein folding stability is often determined by the hydrophobic core. Binding and function are often determined by the hydrophobic residues on the interface. In nature, there should be a funnel for stability of protein folding. But the landscape might not be maximally funneled, or having the maximal stability. This is clearly shown from the mutational experiments on folding 12. The native proteins are neither thermodynamically the most stable nor the kinetically fastest folders. This means that not all the hydrophobic residues are distributed inside the core of the proteins. There are certain distributions of the hydrophobic residues on the surface for functional purposes (binding). There are disadvantages from overstable and superfast protein folders, because they have less biological functions due to the decrease of surface hydrophobic residues. Furthermore, there is little flexibility to adapt for evolution. So the combined landscape is a delicate balance among folding and binding. The biomolecules should have enough thermodynamic stability and also maintain certain flexibilities for functions.
It is important to unravel the relationship between affinity and specificity. When biomolecules themselves are flexible, part of the affinity is used to adjust the conformations to best fit the binding partners. Therefore, flexibility through conformational change usually gives a good opportunity for realizing the specificity for molecular recognition, but often with the price of sacrificing certain amounts of affinity to adjust the conformations. The resulting lower affinity can give molecules the ability to both bind specifically and unbind easily, which is essential for cell signaling relay and gene regulation.
We believe specificity can be used as an important indicator in addition to affinity for drug screening and design (J. Wang, Y. Yang, D. Druekhammer, W. Yang, and G. M. Verkhivker, unpublished data).
The formalism here for the free energy of flexible binding can be extended to multiple binding complexes or to multidomain protein folding. This approach can also be extended to include multibody interactions among residues.
L.X. and E.K.W. are supported by the National Science Foundation of China. J.W. is supported by the National Science Foundation and Petroleum Research Fund.
An online supplement to this article can be found by visiting BJ Online at http://www.biophysj.org.
1. (1890). Ueber die Optischen Isomeren des Traubenzuckers, der Gluconsaure und der Zuckersaure. Ber. Dtsch. Chem. Ges. 23, 2611–2620. PubMed
2. (1958). Application of a theory of enzyme specificity to protein synthesis. Proc. Natl. Acad. Sci. USA 44, 98–104. CrossRef | PubMed
3. (2002). Intrinsic disorder and protein function. Biochemistry 41, 6573–6582. PubMed
4. (2000). Speeding molecular recognition by using the folding funnel: the fly-casting mechanism. Proc. Natl. Acad. Sci. 97, 8868–8873. CrossRef | PubMed
5. (2003). The physics and bioinformatics of binding and folding: an energy landscape perspective. Biopolymers 68, 333–349. CrossRef | PubMed
6. (2004). Protein topology determines binding mechanism. Proc. Natl. Acad. Sci. USA 101, 511–516. CrossRef | PubMed
7. (2002). Coupling of folding and binding for unstructured proteins. Curr. Opin. Struct. Biol. 12, 54–60. CrossRef | PubMed
8. (2006). Single-molecule dynamics reveals the cooperative binding-folding in biomolecular recognition. PLoS. Comput. Biol. 2, 0842–0852. PubMed
9. (2003). Energy landscape theory, funnels, specificity, and optimal criterion of biomolecular binding. Phys. Rev. Lett. 90, 188101–188104. CrossRef | PubMed
10. (1998). Theory of biomolecular recognition. Curr. Opin. Struct. Biol. 8, 245–249. CrossRef | PubMed
11. (1992). Optimal protein-folding codes from spin-glass theory. Proc. Natl. Acad. Sci. USA 89, 4918–4922. CrossRef | PubMed
12. (2003). Folding at the speed limit. Nature 423, 193–197. CrossRef | PubMed