| Pathways and Kinetic Barriers in Mechanical Unfolding and Refolding of RNA and Proteins Structure, Volume 14, Issue 11, 1 November 2006, Pages 1633-1645 Changbong Hyeon, Ruxandra I. Dima and D. Thirumalai Summary Using self-organized polymer models, we predict mechanical unfolding and refolding pathways of ribozymes, and the green fluorescent protein. In agreement with experiments, there are between six and eight unfolding transitions in the ribozyme. Depending on the loading rate, the number of rips in the force-ramp unfolding of the ribozymes is between two and four. Force-quench refolding of the P4-P6 subdomain of the ribozyme occurs through a compact intermediate. Subsequent formation of tertiary contacts between helices P5b-P6a and P5a/P5c-P4 leads to the native state. The force-quench refolding pathways agree with ensemble experiments. In the dominant unfolding route, the N-terminal α helix of GFP unravels first, followed by disruption of the N terminus β strand. There is a third intermediate that involves disruption of three other strands. In accord with experiments, the force-quench refolding pathway of GFP is hierarchic, with the rate-limiting step being the closure of the barrel. Summary | Full Text | PDF (2148 kb) |
| Platform F: DNA, RNA, Structure & Conformation Biophysical Journal, Volume 94, Issue , 1 February 2008, Pages 21-24 Full Text | PDF (85 kb) |
| Forced-Unfolding and Force-Quench Refolding of RNA Hairpins Biophysical Journal, Volume 90, Issue 10, 15 May 2006, Pages 3410-3427 Changbong Hyeon and D. Thirumalai Abstract Nanomanipulation of individual RNA molecules, using laser optical tweezers, has made it possible to infer the major features of their energy landscape. Time-dependent mechanical unfolding trajectories, measured at a constant stretching force () of simple RNA structures (hairpins and three-helix junctions) sandwiched between RNA/DNA hybrid handles show that they unfold in a reversible all-or-none manner. To provide a molecular interpretation of the experiments we use a general coarse-grained off-lattice Gō-like model, in which each nucleotide is represented using three interaction sites. Using the coarse-grained model we have explored forced-unfolding of RNA hairpin as a function of and the loading rate (). The simulations and theoretical analysis have been done both with and without the handles that are explicitly modeled by semiflexible polymer chains. The mechanisms and timescales for denaturation by temperature jump and mechanical unfolding are vastly different. The directed perturbation of the native state by results in a sequential unfolding of the hairpin starting from their ends, whereas thermal denaturation occurs stochastically. From the dependence of the unfolding rates on and we show that the position of the unfolding transition state is not a constant but moves dramatically as either or is changed. The transition-state movements are interpreted by adopting the Hammond postulate for forced-unfolding. Forced-unfolding simulations of RNA, with handles attached to the two ends, show that the value of the unfolding force increases (especially at high pulling speeds) as the length of the handles increases. The pathways for refolding of RNA from stretched initial conformation, upon quenching to the quench force , are highly heterogeneous. The refolding times, upon force-quench, are at least an order-of-magnitude greater than those obtained by temperature-quench. The long -dependent refolding times starting from fully stretched states are analyzed using a model that accounts for the microscopic steps in the rate-limiting step, which involves the to transitions of the dihedral angles in the GAAA tetraloop. The simulations with explicit molecular model for the handles show that the dynamics of force-quench refolding is strongly dependent on the interplay of their contour length and persistence length and the RNA persistence length. Using the generality of our results, we also make a number of precise experimentally testable predictions. Abstract | Full Text | PDF (1091 kb) |
Copyright © 2007 The Biophysical Society. All rights reserved.
Biophysical Journal, Volume 92, Issue 3, 731-743, 1 February 2007
doi:10.1529/biophysj.106.093062
Biophysical Theory and Modeling
Changbong Hyeon*,
,
and D. Thirumalai*, †,
, 
* Biophysics Program, Institute for Physical Science and Technology, University of Maryland, College Park, Maryland
† Department of Chemistry and Biochemistry, University of Maryland, College Park, Maryland
Address reprint requests to D. Thirumalai, Tel.: 301-405-4803.
Address reprint requests to Changbong Hyeon, Tel.: 858-534-7354.
) changes dramatically as the loading rate is varied. At loading rates comparable to those used in laser optical tweezer experiments, the hairpin is plastic, with
being midway between folded and unfolded states; whereas at high loading rates,
moves close to the folded state, i.e., RNA is brittle. For the 29-nucleotide TAR RNA with the three-nucleotide bulge, unfolding occurs in a nearly two-state manner with an occasional pause in a high free energy metastable state. Forced unfolding of the 55 nucleotides of the Hepatitis IRES domain IIa, which has a distorted L-shaped structure, results in well-populated stable intermediates. The most stable force-stabilized intermediate represents straightening of the L-shaped structure. For these structures, the unfolding pathways can be predicted using the contact map of the native structures. Unfolding of a RNA motif with internal multiloop, namely, the 109-nucleotide prohead RNA that is part of the ϕ29 DNA packaging motor, at constant value of rf occurs with three distinct rips that represent unraveling of the paired helices. The rips represent kinetic barriers to unfolding. Our work shows 1), the response of RNA to force is largely determined by the native structure; and 2), only by probing mechanical unfolding over a wide range of forces can the underlying energy landscape be fully explored.The discovery of self-splicing catalytic activity of ciliate Tetrahymena thermophila ribozyme 1,2 and subsequent findings that RNA molecules play an active role as enzymes 3 in many cellular processes have revolutionized RNA research. Just as for protein folding, the structures and functions of RNA enzymes (ribozymes) are linked. As a result, the RNA folding problem—namely, how a nucleotide sequence folds to the native state conformation—is important in molecular biology. Several studies from a number of groups 4,5,6,7,8,9 have shown that, under in vitro conditions, ribozymes have rugged energy landscapes. Despite significant advances in our understanding of how ribozymes fold, several outstanding issues remain, which can be addressed using single molecule experiments 4,5,6,7,8,9,10.
Thermodynamic and kinetic measurements in ensemble and single-molecule florescent energy transfer experiments are typically made by varying the concentration of counterions. Recently, using the laser optical tweezer (LOT) setup, mechanical force has been used to trigger folding and unfolding of RNA molecules at a single-molecule level 11,12. Mechanical force, applied to a specific position of the molecule, induces sequence- and structure-dependent response, which is reflected in the force-extension curve (FEC) that is usually fit using the worm-like model 13,14. The stability of RNAs is inferred by integrating the FECs. For simple motifs, such as hairpins, it has been shown that the stability of the native structures can be accurately measured using mechanical unfolding trajectories that exhibit multiple transitions between the folded and the unfolded state when the force is held constant 11. Similarly, thermodynamics of ribozymes can also be obtained using the nonequilibrium work theorem 15,16.
Mechanical force has also been used to probe unfolding and refolding kinetics of RNA. The cooperative reversible folding of hairpins has been shown by monitoring the end-to-end distance (R), a variable conjugate to the mechanical f, as a function of time. This procedure works best when RNA folding is described using two-state approximation. For multidomain ribozymes, the folding/unfolding kinetics is complex and new tools are required to interpret the kinetic data. In a pioneering study, Onoa et al. 12 showed that the rips in FECs for the L-21 derivative of Tetrahymena thermophila ribozyme (T. ribozyme), composed of multiple domains, are a result of unfolding of individual intact domains that are stabilized in the native state by counterion-dependent tertiary interactions.
The single molecule studies show that the response to mechanical force is a powerful tool to analyze the underlying principles of RNA self-assembly. The extraction of unfolding pathway using FECs alone is not easy, especially when the ribozyme is composed of multiple domains 12. To decipher the unfolding pathways of T. ribozyme, Onoa et al. 12 did a series of experiments in which FECs of different independently folding subdomains were used to interpret the order of unfolding of the substructures. In RNA, there is a clear separation in the free energies associated with secondary and tertiary interactions. Thus, the FEC for a multidomain ribozyme is, to a first approximation, the union of the FECs for the individual domains. Such a strategy can be used to assign a rip of FEC to the unfolding of a particular subdomain as long as the contour lengths of two different unfolded motifs are not similar. Moreover, it is known that the precise response of RNA to force depends not only on the sequence and the native structure but also on how the force is applied 17,18. Single-molecule experiments can be performed in different modes that includes either force-clamp (f is in constant) 11,19 or force-ramp (f varies in a time-dependent manner) 18,20. Theoretical studies have proposed models for obtaining a number of experimentally measurable quantities including FECs for RNA 21,22,23. Computational studies have shown, using RNA hairpin as an example, that the kinetics of unfolding and force-quench refolding as well the nature of unfolding depend on the magnitude of f and the loading rate (rf) 17,24. These studies show that it is important to complement the single-molecule studies with computations that can reliably resolve key issues that are difficult to address in experiments.
In this article, we probe the forced-unfolding dynamics of RNA molecules using a simple model. Because these simulations can be used to directly monitor structures in the transition from folded to the fully stretched states, unfolding pathways can be unambiguously resolved. We introduce the self-organized polymer (SOP) model for RNA that is based only on the self-avoiding nature of the RNA and the native structure. We apply the SOP model to probe forced-unfolding of a number of RNA structures of varying complexity. Many of the subtle features of the variations in the mechanical unfolding as a function of f and rf can be illustrated using P5GA, a simple RNA. For example, we show that the dramatic movements in the location of the unfolding transition state occur as rf (or f) is varied. Applications to structures of increasing complexity (TAR RNA, prohead RNA from domain IIa of the Hepatitis C virus, ϕ29 DNA bacteriophage motor) show that discrete intermediates can be populated in force-ramp and force-clamp simulations over a certain range of forces. Our results show that the response of RNA to force is largely dependent on the architecture of the native state. More importantly, we have established that the characterization of the energy landscape requires using force values (or loading rates) over a wide range.
Our goal is to construct a model for obtaining mechanical folding and unfolding trajectories for simple RNA hairpins to large ribozymes. The model has to be realistic enough to take into account the interactions that stabilize the native fold, yet simple enough that the response to a wide range of forces and loading rates can be explored. To this end, we introduce a new class of versatile coarse-grained self-organized polymer (SOP) model that is particularly well suited for single-molecule force spectroscopy applications of large ribozymes and proteins. The SOP model can be used to probe the response of mechanical force that is applied by means of force-clamp (constant force), force-ramp, and force-quench. The reasons for using SOP model in force spectroscopy applications are the following.
With the above observations in mind, we propose the SOP model for RNA that retains chain connectivity and favorable attractive interactions between sites that stabilize the native fold. Each interaction center represents the center of mass of a nucleotide. In terms of the coordinates {ri, i=1, 2, …N} of RNA with N nucleotide, the total potential energy in the SOP representation is
![]() | (1) |
The first term is for the chain connectivity. The finite extensible nonlinear elastic potential 26 is used with k=20kcal/(mol×Å2), R0=0.2nm, and ri, i+1 is the distance between neighboring beads interaction centers i and i+1,
is the distance in the native structure. The use of finite extensible nonlinear elastic potential is more advantageous than the standard harmonic potential, especially when considering forced-stretching because the fluctuations of ri, i+1 are strictly restricted around
with variation of ±R0. The Lennard-Jones potential is used to account for interactions that stabilize the native topology. Native contact is defined for the pair of interaction centers whose distance is <RC≤1.4nm in the native state for |i−j|>2. If i and j sites are in contact in the native state, Δij=1, otherwise Δij=0. We used ϵh=0.7kcal/mol for the native pairs, ϵl=1kcal/mol for nonnative pairs. In the current version, we have neglected nonnative attractions that will not qualitatively affect the results because, under tension, such interactions are greatly destabilized. To ensure the noncrossing of the chain, we set σ=7Å. Only for i, i+2 pairs we set σ*=3.5Å to prevent the flattening of the helical structures when the overall repulsion is large. There are five parameters in the SOP force field (k, R0, ϵh, ϵl, and Rc) 27. Of these, the results are sensitive to the precise values of ϵh/ϵl and Rc. We have discovered that the quantitative results are insensitive to Rc as long as it is in the physical range that is determined by the RNA contact maps. In principle, the ratio ϵh/ϵl can be adjusted to obtain realistic values of forces. For simplicity, we choose a uniform value of ϵh for all RNA constructs. Surprisingly, the SOP force field, with the same set of parameters, can be used to obtain near-quantitative results for RNA molecules of varying native topology.
The time spent to calculate Lennard-Jones forces scales as
. Drastic savings in computational time can be achieved by truncating the forces due to the Lennard-Jones potential for interaction pairs with rij>(
or 3σ) to zero. We refer to the model as the self-organized polymer (SOP) model because it only uses the polymeric nature of the biomolecules with the crucial topological constraints that arise by the specific fold. For probing forced-unfolding of RNA (or proteins) it is sufficient to include attractive interactions only between contacts that stabilize the native state (see Eq. (1)). We believe none of the results will change qualitatively if this restriction is relaxed, i.e., if nonnative interactions are also taken into account.
Using the SOP model, we simulated the mechanical unfolding and refolding of various RNA structures from a simple hairpin to a large ribozyme (N ≈ 400). To simulate force-ramp experiments, we pull a harmonic spring (ks=28pN/nm), which is attached to the 3′ end of molecule at a constant speed (v). The time(t)-dependent force acting on the 3′ end is
, where z is zth coordinate of the 3′ end. In force-clamp simulations a constant force is applied to one end of the molecule while the other end is fixed. Finally, in force-quench computations the force on the molecule is reduced to the final value to initiate mechanical refolding. In both the force-clamp and force-quench setups the dynamics of the linker (usually hybrid RNA/DNA handles) is not relevant; however, depending on the characteristics of the linkers, the dynamics of linker may play an important role in the force-ramp experiments 24.
Since a typical value for the mass of a nucleotide, m ∼ 300–400g/mol, the average distance between the adjacent nucleotides in the SOP representation of RNA is a ≈ 5Å, ϵh=0.7kcal/mol, and the natural time is
. We use τL=4.0ps to convert simulation times into real times. To estimate the timescale for mechanical unfolding dynamics, we use a Brownian dynamics algorithm 28,29, for which the natural time for the overdamped motion is
. We used
in the overdamped limit, which approximately corresponds to the friction constant of a nucleotide in water.
The equations of motion in the overdamped limit are integrated using the Brownian dynamics algorithm. The position of a bead i at the time t+h is given by
![]() | (2) |
; the Newtonian force acting on a bead is i; and Γ(t) is a random force on ith bead that has a white noise spectrum. The autocorrelation function for Γ(t) in the discretized form is![]() | (3) |
) are
with τL=4ps, ϵh≈0.7kcal/mol, and kBT≈0.6kcal/mol. The system composed of RNA and the spring is extended along the force direction by δx every 104τH=0.47μs integration time steps. We chose δx=0.003nm, so that the pulling speed,
, for h=0.1 τL. To maintain numerical stability, neither h nor δx should be too large.In RNA with simple native structures, force-induced unfolding pathways can be qualitatively predicted from the native structure. To rationalize the simulated unfolding pathways it is useful to construct RNA contact maps. We generated the contact maps (Figure 2 and Figure 3 and Figure 4 and Figure 5) using the definition of native contact. More precisely, we generated a matrix
where matrix elements are
![]() | (4) |
is the distance between two nucleotides in the SOP representation of the native fold. Representing the native RNA using only the center of mass of each nucleotide as the interaction center is a drastic simplification. To ascertain whether the SOP representation misses any essential feature of the RNA structure we also generated the distance map using the heavy atom (C, N, O, P) coordinates. The coarse-grained model captures the important interactions on length scales, i.e., >∼0.7nm. For example, the SOP contact map (Figure 4C) and the distance map (Fig. 1) of 1uud are similar.The dynamics of RNA unfolding is monitored using a number of variables including the time-dependence of R and the number of nucleotide-dependent native contacts Qi(t) that remain at time t. We define
, where RC is the cutoff distance for native contacts; rij(t) is the distance between i and jth nucleotide; and Δij=1 for native contact, otherwise Δij=0. If a certain subdomain of the molecule is disrupted and loses its contacts, the extension of the molecule suddenly increases and the mechanical force exerted on the end of the molecule drops instantly. These molecular events are reflected as rips in the FEC. When the time-dependence of the force f(t) or the end-to-end distance R(t) is directly compared with Qi(t) using t as a progressive variable to describe unfolding, the direct correlation between sudden drops (sudden increase) in the value of f(t), (R(t)), and Qi(t) enables us to unambiguously identify the structures involved in the dynamics of rupture of contacts at the nucleotide level.
The stability of RNA molecule in the native state can be approximated as the sum of interactions
where
and
are interactions that stabilize the secondary and tertiary structures, respectively, i refers to the number of secondary structural elements, and k labels the tertiary contacts that may be mediated by counterions. The contributions from the tertiary interactions are small compared to the energetics associated with the secondary interactions (
). Because of the stability gap between the secondary and the tertiary interactions the analysis of FEC for RNA can be independently made domain by domain. The hairpin stacks, which can vary in the length and sequence, are among the simplest structural motifs. Additional structural complexity in RNA arises due to the presence of hairpin loops, bulges, internal loops, and internal multiloops. The remarkable structural diversity of RNA secondary structures allows us to probe the sequence and fold-dependent energy landscape using force as a perturbation. Here, we discuss the force spectroscopy of relatively simple RNA motifs using four examples. Many aspects of the physics of mechanical unfolding of RNA, such as the shifts in the transition-state locations as rf is changed, can be understood using these simple structural motifs as examples.
Liphardt et al. 11 showed that the P5ab hairpin, the construct in which P5c stem-loop and the A-rich bulge in P5a are removed from the P5abc subdomain in T. ribozyme, reversibly folds in an all-or-none fashion upon application of constant force. The equilibrium between the native basin of attraction (NBA) and the unfolded basin of attraction (UBA) can be shifted by altering the value of the constant force, fc. To probe the two-state behavior of hairpins under force we used a smaller 22-nt hairpin, P5GA (Protein Data Bank (PDB) id: 1eor) 17,30. For the P5GA hairpin, simulations over a wide range of forces can be performed in reasonable times. The topologically simple hairpin has a single tetra-loop and nine consecutive basepairs. In an earlier study 17 we showed, using a minimal three-interaction site (TIS) model in which each nucleotide is represented by three sites, that the dynamical behavior of P5GA under tension is qualitatively similar to P5ab. The much simpler SOP representation of P5GA allows us to probe exhaustively the folding and unfolding kinetics of the hairpin that is manipulated by force-ramp, force-quench, and force-clamp.
The hallmark of P5ab 11 and P5GA 17, when a constant force is applied to either the 3′ or the 5′ ends, is the observation of bistable kinetics. When a constant fc is applied to the 3′ end, P5GA makes transitions (Figure 2B) between the UBA (R≈8nm) to the NBA (R≈2nm). At fc=14.0, 15.4, and 17.5pN, a large number of transitions occur over 45-ms duration, which suggests that the hairpin dynamics is effectively ergodic. As in our previous study 17, the equilibrium constant between the folded and unfolded hairpin calculated using a long mechanical unfolding trajectory coincides with an independent ensemble average calculation, i.e., time averages are roughly equivalent to ensemble averages. When fc=14pN, the residence time in the NBA is much greater than in the UBA while, at fc=16.8pN, the UBA is preferentially populated (Figure 2B). The population of P5GA in the NBA changes when fc is varied, as can be seen in the histogram (P(R)) of the end-to-end distance R (Fig. 2). At fc=15.4pN, which is slightly above the midpoint of the NBA↔UBA transition, several jumps between the NBA and UBA are observed. The P(R) distribution reflects the bistable nature of the landscape. The free energy profile with respect to R is computed using ΔF(R)=−kBT log P(R). From P(R) at fc=15.4 we can obtain the free energy of stability of the folded hairpin with respect to the unfolded state using ΔG≈fcΔRUF where ΔRUF is the distance between the folded and unfolded states of P5GA. Using ΔRUF≈6nm, we find that ΔG≈13kcal/mol. The Vienna RNA package 31, which uses entirely different free energy parameters for RNA, gives ΔG≈12.8kcal/mol. This comparison shows that the SOP model can, for simple structures, give accurate results for stability. At fc=15.4pN, the transition barrier is ∼1.5 kBT. The UBA is more populated at this value of fc. The observed transition times are much shorter than the residence times in each basin of attraction, which is also a reflection of the underlying cooperativity of the all-or-none of nature of hopping between UBA and NBA.
Qualitatively similar results were observed in our previous study using the three-interaction site (TIS) model 17. However, the values of the midpoint of the force was approximately a factor-of-two smaller in the TIS model than in the SOP model. Despite the large differences in the nature of the force fields, the overall results are robust, suggesting that it is the underlying native structure determining the nature of the force-induced transitions in simple RNA. Indeed, for simple structures the mechanism of forced unfolding in RNA helices are imprinted in the contact map (Figure 2A), which is a two-dimensional representation of the folded hairpin (Methods). The clustered band in the contact map suggests that P5GA should unfold when a critical number of stacking interactions are unzipped. Thus, the contact map for P5GA (and presumably P5ab) is consistent with the observed two-state kinetics. As the architecture of the native state becomes more complex, it becomes difficult to anticipate the unfolding mechanism using the contact map alone (see below).
We also performed force-ramp simulations by subjecting the P5GA hairpin to a continuously changing force, i.e., varying the loading rate (Methods). The simplicity of the SOP model allows us to use values of rf that are comparable to those used in laser optical tweezer (LOT) experiments. At
), the force-extension curves show a transition to the UBA at f∼13pN (Figure 3A). As the force dynamically increases, we observe bistable fluctuations in the FEC between the NBA and the UBA just as when force is held constant (Figure 3A). The conformational fluctuations between the two states are unambiguously seen in the time-dependence of the end-to-end distance (R(t)) (Figure 3B). As time progresses, the force is ramped up, resulting in global unfolding (R≈8nm) for t>400ms (Figure 3B). During the timescale of simulation, we find frequent and sharp transitions between the UBA and the NBA (Figure 3B).
,
,
, and 10
(
(ks=0.7pN/nm, v=6.4μm/s)). The red star is the rupture force value (f*=12.5pN) at rf=45pN/s. (E) Plot f*, the most probable unfolding force, as a function of rf. The position of transition state
is computed using
. The inset shows the variation of
as a function of rf.The location of the unfolding transition state
for proteins and RNA is often estimated from force-ramp experiments using the variation of the most probable rupture force with rf ([f*, log rf] plot). The loading rate, rf=df(t)/dt, and can be accurately estimated from the slope of the time-dependence of f(t) as a function of time. The slope of f(t) as a function of t (Figure 3C) is nearly the same as rf≈ks×v, where v is the pulling speed. Strictly speaking, rf=keff×v with
and ks, kmol, and klinker are the spring constants of the optical trap, the RNA molecule, and linker, respectively. Typically ks≪kmol, klinker, thus keff≈ks32. Throughout the article, we obtain the loading rate using rf=ksv.
From the force distributions, computed at four different loading rates (Figure 3D), we observe that the most probable rupture force (f*) does not increase logarithmically over a wide range of loading rates (Figure 3E). Only if the range of rf is restricted, f* changes linearly with log rf33. The location of the transition state (
) is usually calculated using
33, which may be reasonable as long as rf range is small. However, the [f*, log rf] plot is highly nonlinear (Figure 3E). If we use linear regression to analyze the [f*, log rf] plot then
for the distance between the NBA and the transition state. The small value of
is a consequence of the large variation of
as rf is changed 17,34. If the loading rate is varied over a broad range, the rf-dependence of
is manifested as a pronounced convex curvature 24 in the [f*, log rf] plot (Figure 3E). Based on the equilibrium free energy profile F(R) as a function of R17, we expect that, for P5GA,
≈3nm if rf is small. Indeed, from the constant force simulation results in Figure 2B we find
. Thus, the slope of [f*, log rf] should decrease (
increases) as rf decreases. To illustrate the dramatic movement in the transition state, we have calculated
using f* values for two consecutive values of rf. For example, using f*≈12.5pN at rf=450pN/s and f*≈14.9pN at rf=4.5×103pN/s (a value that can be realized in atomic force microscopy experiments), we obtain
≈4.0nm. From the values of f* at five values of rf, we find that
can move dramatically (Figure 3E). In particular, we find that
changes by nearly a factor of 10 as the loading rate is decreased to values that are accessible in LOT experiments (see inset in Figure 3E). Because of the nearly logarithmic variation of
on rf over a narrow range of rf, we do not expect
to change appreciably if rf is lowered from 45pN/s to ≈5pN/s. At high rf or fc, the unfolded state is greatly stabilized compared to the folded state. From Hammond’s postulate 35, generalized to mechanical unfolding 24, it follows that, as fc increases,
should move closer to the native state. The simulations are, therefore, in accord with the Hammond’s postulate.
The presence of bulges or internal loops contributes to the bending of the stiff helical stack. The enhanced flexibility enables formation of intramolecular tertiary contacts or facilitates RNA-protein interactions. Base-sugar and base-phosphate as well as base-base contacts are found in these structural elements in the native structure. For example, the base group of U23 in HIV-1 TAR RNA (PDB code 1uud 36, Figure 4A), protrudes away from the hairpin stack and makes tertiary contacts with i=38–41 (see three-dimensional structure in Figure 4B). The inherent flexibility in the bulge region also facilitates interaction with ligands 37,38.
(ks=0.7pN/nm, v=6.4μm/s) (left) and
(ks=0.7pN/nm, v=0.64μm/s) (right) for a number of molecules. The arrow in the left panel is the signature of a kinetic intermediate, which is absent when rf is reduced.To probe the effect of the U23C24U25 bulge (Figure 4A) on mechanical unfolding, we performed force-ramp and force-clamp simulations. The time-dependence of R at fc=14pN (NBA is the preferred state) shows multiple transitions. Unlike in P5GA some of the transitions to UBA involves a small pause in intermediate values of R(5–8) nm, which is suggestive of a short-lived intermediate. Force-clamp simulations also show that R fluctuates between 2nm (NBA) and 10nm (UBA). In addition, there is a signature for an intermediate with R between (5 and 8) nm. The free energy profile ΔF(R) shows that there is a high free energy intermediate centered at R ≈ 6nm that is metastable with respect to the NBA and the UBA. At fc=14pN, the intermediate is less stable than the UBA and the NBA. It is likely that the instability of the intermediate state might make it difficult for experimental detection. As force is increased to fc (= 14.7pN), which is very close to the critical value at which the stabilities of the folded and stretched states are nearly equal, the residence times in the NBA and UBA are similar (see middle panel in Figure 4D). The P(R) distribution and ΔF(R) as a function of R show that fc=14.7pN is close to the critical value. Interestingly, at fc=14.7pN, the shallow high free energy intermediate is less pronounced than at fc=14.0pN (Figure 4D). When force is further increased to fc=15.4pN, the intermediate becomes essentially a part of the UBA. When the hairpins are unzipped, the presence of bulges and internal loops contributes to the formation of the intermediate state when tertiary contacts between these bulges and the rest of the structure are disrupted. In the predicted intermediate, the first six basepairs (Figure 4A) and additional contacts associated with these nucleotides are ruptured (see the representative structures in Figure 4D).
The contact map for TAR RNA has two “clusters” (Figure 4C): one at the upper-right corner that is associated with the lower hairpin stack, and the other (nucleotides 25–31) that represents the structure from the bulge to the apical end. As the lower stack unfolds, the force propagation goes along the diagonal for the upper cluster to the lower cluster (Figure 4C). We can qualitatively predict the kinetic barriers that oppose forced-unfolding at high rf using the contact map computed from the native structure (Figure 4C). If hairpin unfolding occurs in a sequential unzipping manner, then we expect (upon applying force to the 3′ end) the structure to disrupt along the direction specified by the series of red arrows in Figure 4C. We predict that, upon disruption of the first basepair (G17C45), the other contacts involving G17 and C45 ((17,39), (17,40), (17,41), (17,42), (17,43), (17,44), (17,45), (18,45), (19,45), and (20,45)) should spontaneously break (upper cluster). Similarly, upon rupture of the G18-C44 basepair, the contacts associated with these nucleotides are disrupted (lower cluster). Therefore, the contact map suggests that rupture should occur as force propagates along the diagonal direction (red arrows in Figure 4C). When the contacts associated with the lower clusters unravel, a high energy intermediate is populated (Figure 4C). Only detailed simulations can reveal the stability and lifetime of the intermediate.
The free energy profile as a function of R can also be computed from the contact map using
, where 〈Nc(T=300K)〉 is the average number of contacts at temperature T=300K at zero force, and NG is the corresponding quality in the PDB structure. The first term is the energetic contribution arising from Nc surviving contacts and the second term accounts for the entropy arising from the segment in which Nd contacts are disrupted. The energetic contribution using Nd and the entropy is given by the product of Nd and the entropy associated with the freely jointed chainlike model. We used Kuhn length d∼2.0nm, and the effective nucleotide length of single-strand RNA, lss∼0.59nm. The chain extension is given by R=R0+Ndlss, where R0 is the end-to-end distance in the folded state. In the presence of constant force, the free energy profile tilts to UBA. The free energy profile ΔF(R) at fc≈17pN is suggestive of an intermediate R ≈ 6nm, whereas at higher force, the signature of the intermediate disappears (see also Figure 4D). Since we approximated the conformational entropy using the freely jointed chain model in this exercise, the estimate for the equilibrium critical force does not coincide with the SOP simulations, which shows that the freely jointed chain model does not estimate the entropy of the finite-sized RNA structures. Nevertheless, for simple hairpins, the number of kinetic barriers or kinetic intermediates can be predicted using this simple analysis based only on the knowledge of native contact topology. More recently, Cocco et al. 23 have proposed a similar scheme using the Monte Carlo simulations on the sequence-dependent free energy profile computed with the Turner’s thermodynamic rule 39. A similar analysis was previously used to estimate equilibrium-unbinding force for proteins 40.
The FECs obtained using force-ramp simulations at rf=4.5×103pN/s (Figure 4E) also show that one intermediate is present. At this value of rf, the presence of an intermediate occurs as a rip in the FECs (indicated using an arrow) at f≈23pN. However, when rf is lowered by a factor of 10 (see right panel in Figure 4E), there is no signature of a rip at f≈15–17pN that corresponds to a pause in a high free energy metastable intermediate. As time increases, the global unfolding are preceded by fluctuations between metastable intermediates to the folded state, and TAR RNA unfolds in an all-or-none manner at a force of ∼20pN. The picture that emerges from force-ramp simulations—namely, the presence of an intermediate at high rf and its absence at low rf—is completely consistent with constant force simulations. The rf-dependent rupture of RNA structures is a general property of self-organized molecules, which we explore fully using physical arguments and simulations of ribozymes (see below).
We consider mechanical unfolding of the 55-nt domain IIa of the Hepatitis C Viral (HCV) genome whose NMR structure 41 is known (PDB code: 1p5m). The secondary structure map of the hairpin contains bulges and is capped by the UUCG tetraloop at the apical end (Figure 5A). The domain IIa oligonucleotide adopts a distorted L-shaped structure 41 (Figure 5B) with a relatively flexible hinge bulge (A53–A57) that is stabilized by Mg2+. Just as for the TAR RNA, the number of plausible kinetic intermediates (in the appropriate force regime) in the NBA→UBA transition and the range of force and R values over which they occur can be anticipated using the contact map, which reflects the nature of the native fold. The contact map (Figure 5C) shows that it can be partitioned into three distinct clusters that are spatially adjacent. Upon application of force to the 3′ end, the rupture of contacts associated with nucleotides G1 and C55 occurs and force propagates diagonally (see upper-right corner of Figure 3C). There is a change in the structure of the contact map, with the breaking of contacts involving the basepair (A8–U49) that signal the formation of the first intermediate. As a result, force propagates along the diagonal associated with second cluster. Upon disruption of A8U49, basepair force propagates along the diagonal of the subcontact map (blue line in Figure 5C). The second intermediate is populated when all the base contacts involving this substructure unravel. Similarly, for 1p5m, we expect population of the third intermediate, which opposes forced unfolding associated with substructure-involving contacts associated with the nucleotides near the vicinity of the green line in Figure 5C.
(ks=0.7pN/nm, v=6.4μm/s) for three trajectories. The arrow indicates the kinetic intermediates.Explicit force-clamp simulations confirm that there indeed are three intermediates associated with mechanical unfolding of 1p5m. We generated 10 unfolding trajectories for a cumulative 800ms at fc=21pN. As indicated approximately by the energy profile (Figure 5C), we find multiple transitions between the structures that are revealed as plateaus in R(t) (Figure 5D). Not all the possible transitions are explicitly observed in all the trajectories. This observation may be a reflection of the intrinsic heterogeneity or stochastic nature of fluctuations. By averaging the residence time in each basin of attraction over time and the initial conditions, we obtain P(R) and the associated free energy profile (Figure 5D). The NBA → UBA occurs through a sequence of three intermediates. The barrier separating the UBA and the first intermediate (R≈5nm) is larger than the subsequent ones. All the intermediates, whose populations are fc-dependent (see below), are metastable with respect to NBA and UBA. The predicted intermediates can be detected experimentally provided they are long-lived. If the lifetime of the metastable intermediate is too short, then a high time resolution would be required. An identical unfolding pathway is also found in force-ramp simulations (Figure 5E) in which we find, in many trajectories, three rips that represent the kinetic intermediates. The forces at which these intermediates are populated are higher because the loading rate is large. The structures that are populated along the unfolding pathway (Figure 5D) are in accord with the predictions based on the secondary structure and contact maps.
Just as found explicitly for TAR RNA, the intermediates may not be populated at lower fc or rf values. From the energy profile in Figure 5C, we find (approximately) that the energy associated with a putative intermediate at R≈10nm (see Figure 5D) is ΔF(R≈10nm) ≈50 kBT≈200pN×nm. Thus, the smallest value of f required to transiently populate the R≈10nm intermediate is fi≈ΔF(R≈10nm)/10nm≈20pN. Only if f exceeds 20pN, can these intermediates be significantly populated. At these forces, the predicted intermediates are metastable with respect to both UBA and NBA. Their experimental detection would depend on their lifetimes. Unless relatively high time resolution is used in experiments, for practical purposes, mechanical unfolding of domain IIa of the HCV IR2S RNA might follow two-state behavior. This simple consideration and the simulations might explain the apparent absence of intermediates in the recent constant-force LOT experiments on TAR RNA performed at f<15pN.
The next level of complexity in the secondary structural RNA motif is the one that contains an internal multiloop. We choose the prohead RNA (PDB code 1foq 42) which is part of the ϕ29 DNA packaging motor. In the context of the motor, pRNA assembles as a pentamer with each monomer consisting of two stem-loops that are separated by an internal multiloop (Figure 6A). Even at this level of complexity, it becomes difficult to predict the kinetic barriers associated with force-induced unfolding using the contact map alone. For the 109-nucleotide RNA, we generated four mechanical force-ramp unfolding trajectories (Figure 6B). The mechanical unfolding of the structure exhibits multiple rips signaling the presence of kinetic barriers separating the NBA and UBA (Figure 6B). The four different FECs, colored in black, red, green, and blue, are distinct. Variations in the rip dynamics from molecule to molecule may reflect the heterogeneous nature of the unfolding pathways. Upon force-ramp, unfolding of the pRNA begins at R∼2nm in all the trajectories (Figure 6B). The FEC in red has three rips at R≈15, 28, and 38nm, while the FEC in green has only one rip at R≈38nm. For pRNA, which is structurally more complex than the simpler three-way junction (e.g., P5abcΔA; see 11), the details of the unraveling mechanism are difficult to extract using the FECs alone. To unambiguously extract the unfolding pathways we have calculated the time-dependence of nucleotide-dependent rupture (Figure 6C) of individual contacts (Methods).
(ks=0.7pN/nm, v=6.4μm/s). (C) The rupture history of pRNA (Qi(t)) corresponding to FECs in panel B. Here Qi(t) is the number of contacts that the nucleotide i has at time t. The scale on the right represents the number of contacts associated with the ith nucleotide. Darker shades have a larger number of contacts than the lighter shades. (D) Structures of pRNA at each stage of unfolding, as time progresses, is illustrated. Structures in i–iii are associated with kinetic barriers in the red trajectory in panel B.Because of the additional stem-loop that branches out of the internal multiloop (Figure 6A), there are potentially two routes by which the unfolding of pRNA can proceed. One is P1→P3→P2, and the other is P1→P2→P3. We can rule out the unfolding pathways such as P2→P1→P3 or P3→P2→P1, because of the directional nature of the applied mechanical force. It should be stressed that such pathways can emerge if force is applied at points other than the ends of the chain. Using the time evolution of the native contact of the ith nucleotide, Qi(t), we unambiguously pin-down the time series of the rupture of individual contacts upon force-ramp (Figure 6C). The rupture pattern of the individual contacts in all the trajectories is very similar (Figure 6C), despite seeming differences in the FECs. From Figure 6C, it is clear that the nucleotides in P1-unravel (Qi(t)) associated with these nucleotides are zero for t>5ms) early (t∼5ms), which signals the first event in pRNA unfolding. Subsequently, Qi(t) for i ∼ (70–85) are disrupted. This shows that the smaller stem-loop in P3 opens, forming a cusp at R∼15nm in the FEC. The bigger stem-loop with the large bulge in P2 follows the opening of P3, which is manifested as a rip near R∼35nm in the FEC. Thus, the unfolding pathway for pRNA is P1 → P3 → P2. Depending on the trajectory, not all the three rips are detected in the FECs, although the unfolding pathway defined by the subdomains is not altered. The sequence of structural changes that accompany the unfolding of pRNA for the trajectory in red in Figure 6B is shown in Figure 6D. Force-ramp simulations of pRNA show that, even when the native structure is not too complex, it is difficult to predict the nature of kinetic barriers without detailed dynamical simulations or experiments. It would be interesting to test these predictions using LOT experiments.
We have used the self-organized polymer representation of a variety of RNA structures to predict their mechanical unfolding trajectories. Constant force and force-ramp simulations show that dramatic changes in the force profiles take place as the loading rates and the values of the force are varied. If the force is varied over a wide range, then regions of the energy landscape that cannot be accessed in conventional experiments can be probed. However, to realize the full utility of the single molecule force spectroscopy, it becomes necessary to use force in distinct modes (constant force, force-ramp, and other combinations) along with reliable computations that can mimic the experimental conditions as closely as possible. The simulations of RNA, with diverse native structures, using the SOP model illustrate the structural details in the unfolding pathways that are experimentally accessible. It is remarkable that the simple, native-state-based SOP model can quantitatively predict the FECs for a number of RNA molecules with varying degree of structural complexity. We conclude the article with the following additional remarks.
The small size and simple architecture of RNA hairpins has allowed us to explore their response to force over a range of rf that spans four orders of magnitude. The lowest rf value is close to those used in LOT experiments. A key prediction of our simulations is that the location of the transition state for P5GA moves dramatically from ∼6nm at low rf to ∼0.5nm at high rf (see inset to Figure 3E). The large value of
at low rf suggests that P5GA is plastic while the small
at high rf is suggestive of brittle behavior. The mechanical properties of RNA structures can be drastically altered by varying the loading rate, which is reminiscent of the changes in the visco-elastic behavior of polymeric materials that changes with frequency. The transformation from plastic to brittle behavior can be captured by the fragility index 34, used to describe mechanical unfolding of hairpins. Although we have discussed the rf-dependent movement of the transition states using P5GA as an example, we predict that this result is general and should be observed in other RNA structures as well.
The predicted movement in the transition state as a function of rf might help resolve the apparent differences in the estimated values of
in proteins using laser optical tweezer (LOT) and atomic force microscopy (AFM) experiments. The typical value of
for a number of proteins using AFM is approximately 0.5nm 43, whereas
for RNase H 44 (the only protein that has been experimentally studied using the LOT setup) is ∼6nm. We believe that such a large difference is not merely due to changes in native topology and stability, which undoubtedly are important. Rather, it is due to the variations in rf. The value of rf in LOT is typically <10pN/s, whereas rf in AFM varies from (100–1000) pN/s. Our findings here suggest that the different loading rates used in the two setups might explain the large differences in the values of
. Thus, it is important to perform experiments on a given protein using both LOT and AFM setups to sort out the loading-rate-dependence of the location of the transition state.
From the contact map it is possible to visualize the directions along which the applied tension propagates. As illustrated using simple RNA structures, the unfolding pathway depends on the direction of tension propagation and local structural stability. Using the contact map alone one can anticipate the most probable unfolding pathways. Here, we have shown that for P5GA, TAR RNA, and HCV IRES domain II it is possible to get a qualitative picture of forced-unfolding using the reduced representation of RNA structures in the form of contact maps. However, as the structural complexity increases and a number of alternate unfolding pathways become possible, the simple native-structure-based method alone is not always sufficient in predicting how a particular structure unravels. Such is the case in the unfolding of the three-way junction prohead RNA studied here at constant rf.
The SOP model is remarkably successful in reproducing the unfolding pathways of complex ribozymes in a realistic fashion. Surprisingly, for both proteins 27 and RNA, the SOP model is quite successful in predicting the nature of unfolding pathways. The unfolding dynamics of proteins, as well as RNA, with size exceeding 250 residues or nucleotides, can be conveniently simulated on a PC in a few days with the simulation condition used in this article (rf≈102−105pN/s). Quenching the force to zero drives the stretched state of molecule close to the native state 27, which allows us to map the folding pathways starting from different initial conditions. The SOP model consisting of the polymeric nature and the minimal characteristics of RNA architecture is reasonable in visualizing the forced-unfolding and force-quench refolding dynamics of RNA molecules. We believe that, when accompanied with the experimental analysis, the SOP model can serve as a useful tool that provides insights into the folding/unfolding process of large macromolecules. However, as with all models, there are certain limitations of the SOP model that prevent us from making quantitative predictions of the measurable force-extension curves, especially for large RNA. The sequence and/or the counterion effect are not explicitly taken into account in the SOP model. The neglect of explicit counterions in the simulations fails to capture their specific coordination with RNA, which in turn leads to an underestimate of the local stability of the folded structure. Hence, in applying the SOP model to investigate structures in which counterion-mediated tertiary interactions determine local structures, ϵh should be varied. Despite the obvious limitations, it is clear that the SOP model is powerful enough to provide insights into the structures and pathways that are explored upon application of force. The model cannot only be used as a predictive tool (as shown here with explicit applications on systems for which experiments are not currently available) but also can be used to interpret experimental results.
We are grateful to Prof. Ruxandra I. Dima and Dr. David Pincus for a number of insightful comments.
This work was supported in part by a grant from the National Science Foundation through National Science Foundation grant No. CHE-05-14056.
1. (1981). In vitro splicing of the ribosomal-RNA precursor of Tetrahymena-involvement of a quanosine nucleotide in the excision of the intervening sequence. Cell 27, 487–496. Abstract | | CrossRef | PubMed
2. (1984). Catalytic activity of an RNA molecule prepared by transcription in vitro. Science 223, 285–286. PubMed
3. (2002). The chemical repertoire of natural ribozymes. Nature 418, 222–228. CrossRef | PubMed
4. (2001). Beyond kinetic traps in RNA folding. Curr. Opin. Struct. Biol. 11, 309–314. CrossRef | PubMed
5. (2003). RNA folding: models and perspectives. Curr. Opin. Struct. Biol. 13, 309–316. CrossRef | PubMed
6. (1997). Folding of RNA involves parallel pathways. J. Mol. Biol. 273, 7–13. CrossRef | PubMed
7. (2005). RNA and protein folding: common themes and variations. Biochemistry 44, 4957–4970. PubMed
8. (2005). Single-molecule RNA folding. Acc. Chem. Res. 38, 566–573. CrossRef | PubMed
9. (1999). New pathways in folding of the Tetrahymena group I RNA enzyme. J. Mol. Biol. 291, 1155–1167. CrossRef | PubMed
10. (2000). A single-molecule study of RNA catalysis and folding. Science 288, 2048–2051. CrossRef | PubMed
11. (2001). Reversible unfolding of single RNA molecules by mechanical force. Science 292, 733–737. CrossRef | PubMed
12. (2003). Identifying kinetic barriers to mechanical unfolding of the T. thermophila ribozyme. Science 299, 1892–1895. CrossRef | PubMed
13. (1994). Entropic elasticity of λ-phase DNA. Science 265, 1599–1600. PubMed
14. (1996). Bending and twisting elasticity of DNA. Macromolecules 27, 981–988. CrossRef | PubMed
15. (1997). Nonequilibrium equality for free energy differences. Phys. Rev. Lett. 78, 2690. CrossRef | PubMed
16. (2002). Equilibrium information from nonequilibrium measurements in an experimental test of Jarzynski’s equality. Science 296, 1832–1835. CrossRef | PubMed
17. (2005). Mechanical unfolding of RNA hairpins. Proc. Natl. Acad. Sci. USA 102, 6789–6794. CrossRef | PubMed
18. (2006). Probing the mechanical folding kinetics of TAR RNA by hopping, force-jump, and force-ramp methods. Biophys. J. 90, 250–260. Abstract | Full Text | PDF (335 kb) | CrossRef | PubMed
19. (2004). Force-clamp spectroscopy monitors the folding trajectory of a single protein. Science 303, 1674–1678. CrossRef | PubMed
20. (1999). Mechanical and chemical unfolding of a single protein: a comparison. Proc. Natl. Acad. Sci. USA 96, 3694–3699. CrossRef | PubMed
21. (2003). Mechanically probing the folding pathway of single RNA molecules. Biophys. J. 84, 2831–2840. Abstract | Full Text | PDF (313 kb) | PubMed
22. (2001). Force-induced denaturation of RNA. Biophys. J. 81, 1324–1332. Abstract | Full Text | PDF (229 kb) | PubMed
23. (2003). Slow nucleic acid unzipping kinetics from sequence-defined barriers. Eur. Phys. J. E 10, 153–161. CrossRef | PubMed
24. (2006). Forced-unfolding and force-quench refolding of RNA hairpins. Biophys. J. 90, 3410–3427. Abstract | Full Text | PDF (1091 kb) | CrossRef | PubMed
25. (2000). Native topology determines force-induced unfolding pathways in globular proteins. Proc. Natl. Acad. Sci. USA 97, 7254–7259. CrossRef | PubMed
26. (1990). Dynamics of entangled linear polymer melts: a molecular-dynamics simulation. J. Chem. Phys. 92, 5057–5086. CrossRef | PubMed
27. (2006). Pathways and kinetic barriers in mechanical unfolding and refolding of RNA and proteins. Structure 14,