| Minor Groove Deformability of DNA: A Molecular Dynamics Free Energy Simulation Study Biophysical Journal, Volume 91, Issue 3, 1 August 2006, Pages 882-891 Martin Zacharias Abstract The conformational deformability of nucleic acids can influence their function and recognition by proteins. A class of DNA binding proteins including the TATA box binding protein binds to the DNA minor groove, resulting in an opening of the minor groove and DNA bending toward the major groove. Explicit solvent molecular dynamics simulations in combination with the umbrella sampling approach have been performed to investigate the molecular mechanism of DNA minor groove deformations and the indirect energetic contribution to protein binding. As a reaction coordinate, the distance between backbone segments on opposite strands was used. The resulting deformed structures showed close agreement with experimental DNA structures in complex with minor groove-binding proteins. The calculated free energy of minor groove deformation was ∼4–6kcalmol in the case of a central TATATA sequence. A smaller equilibrium minor groove width and more restricted minor groove mobility was found for the central AAATTT and also a significantly (∼2 times) larger free energy change for opening the minor groove. The helical parameter analysis of trajectories indicates that an easier partial unstacking of a central TA versus AT basepair step is a likely reason for the larger groove flexibility of the central TATATA case. Abstract | Full Text | PDF (790 kb) |
| DNA Packaging in Bacteriophage: Is Twist Important? Biophysical Journal, Volume 88, Issue 6, 1 June 2005, Pages 3912-3923 Andrew James Spakowitz and Zhen-Gang Wang Abstract We study the packaging of DNA into a bacteriophage capsid using computer simulation, specifically focusing on the potential impact of twist on the final packaged conformation. We perform two dynamic simulations of packaging a polymer chain into a spherical confinement: one where the chain end is rotated as it is fed, and one where the chain is fed without end rotation. The final packaged conformation exhibits distinct differences in these two cases: the packaged conformation from feeding with rotation exhibits a spool-like character that is consistent with experimental and previous theoretical work, whereas feeding without rotation results in a folded conformation inconsistent with a spool conformation. The chain segment density shows a layered structure, which is more pronounced for packaging with rotation. However, in both cases, the conformation is marked by frequent jumps of the polymer chain from layer to layer, potentially influencing the ability to disentangle during subsequent ejection. Ejection simulations with and without Brownian forces show that Brownian forces are necessary to achieve complete ejection of the polymer chain in the absence of external forces. Abstract | Full Text | PDF (298 kb) |
| B-DNA Under Stress: Over- and Untwisting of DNA during Molecular Dynamics Simulations Biophysical Journal, Volume 91, Issue 8, 15 October 2006, Pages 2956-2965 Srinivasaraghavan Kannan, Kai Kohlhoff and Martin Zacharias Abstract The twist flexibility of DNA is central to its many biological functions. Explicit solvent molecular dynamics simulations in combination with an umbrella sampling restraining potential have been employed to study induced twist deformations in DNA. Simulations allowed us to extract free energy profiles for twist deformations and were performed on six DNA dodecamer duplexes to cover all 10 possible DNA basepair steps. The shape of the free energy curves was similar for all duplexes. The calculated twist deformability was in good agreement with experiment and showed only modest variation for the complete duplexes. However, the response of the various basepair steps on twist stress was highly nonuniform. In particular, pyrimidine/purine steps were much more flexible than purine/purine steps followed by purine/pyrimidine steps. It was also possible to extract correlations of twist changes and other helical as well as global parameters of the DNA molecules. Twist deformations were found to significantly alter the local as well as global shape of the DNA modulating the accessibility for proteins and other ligands. Severe untwisting of DNA below an average of 25° per basepair step resulted in the onset of a global structural transition with a significantly smaller twist at one end of the DNA compared to the other. Abstract | Full Text | PDF (485 kb) |
Copyright © 1999 The Biophysical Society. All rights reserved.
Biophysical Journal, Volume 77, Issue 5, 2366-2376, 1 November 1999
doi:10.1016/S0006-3495(99)77074-8
Biophysical Theory and Modeling
N. Bruant*, D. Flatters#, 1, R. Lavery# and D. Genest*,
, 
* Centre de Biophysique Moléculaire, CNRS UPR 4301 and the University of Orléans, 45071 Orléans, France
# Laboratoire de Biochimie Théorique, CNRS UPR 9080, Institut de Biologie Physico-Chimique, 75005 Paris, France
Address reprint requests to Dr. Daniel Genest, Centre de Biophysique Moléculaire, CNRS, rue Charles Sadron, 45071 Orleans cedex 2, France. Tel.: 33-238-25-55-93; Fax: 33-238-63-15-17.The flexibility and internal dynamics of DNA are believed to play a major role in specific protein-DNA recognition. However, while many experiments measuring rotational correlation times have provided evidence of DNA deformations covering nanosecond (Wahl et al,Genest and Wahl, 1978,Hogan and Jardetzsky, 1980,Millard et al) to millisecond (Leroy et al,Guéron et al) time scales, they have not provided an overall spatial description of the internal motions.
An alternative approach to such data is offered by molecular modeling techniques such as molecular mechanics (Lavery, 1994,Packer and Hunter, 1998), Monte Carlo (Zhurkin et al,Gabb et al) or molecular dynamics (MD) (Sprous et al,Feig and Pettitt, 1998) simulations. Dynamic methods are able to describe structural fluctuations and transitions at the atomic level, as a function of base sequence, while taking solvent and counterion effects explicitly into account. However, such calculations are very time consuming, and only short oligonucleotides (∼20–30 bp) can currently be simulated for periods of a few nanoseconds. For many purposes, such data are insufficient, and it is necessary to envisage models of larger DNA fragments and simulations covering much longer periods. On the other hand, it is clearly possible to obtain useful information on nucleic acids without always requiring atomically resolved detail. Both of these facts suggest the importance of developing lower resolution models.
At a mesoscopic level, different simplified physical models have already been developed, in which double-stranded DNA is treated as a long flexible filament, with the environment reduced to simple continuum or stochastic effects (Schlick, 1995,Olson, 1996,Lafontaine and Lavery, 1999). These models may be classed in two broad categories: continuous elastic rods (Barkley and Zimm, 1979,Schlick and Olson, 1992) or strings of beads (Allison and McCammon, 1984,Chirico and Langowski, 1994,Jian et al,Tan and Harvey, 1989). They implicitly correspond to freezing many degrees of freedom, but which degrees of freedom are frozen is a choice generally based on a priori hypotheses, which are not fully under control. As an example, recent work by Chirico and Langowski, 1994 uses a bead model in which each bead is chosen to correspond to a rigid 37-bp DNA segment. Such models are generally parameterized on the basis of a small number of experimental data, such as persistence length measurements or fluorescence anisotropy decay. However, these data are often imprecise and do not cover finer effects, most notably those related to base sequence. They are also only valid for relatively small deformations, which involve neither local nor global structural transitions. Despite these restrictions, bead and rod models of DNA are very important, at least because they allow long simulations on large systems with a very limited number of variables. Dynamic simulations on fragments with thousands of base pairs over millisecond time scales thus become possible.
The present paper is an attempt to establish a bridge between atomic and mesoscopic descriptions of the internal dynamics of DNA and, in particular, to make judicious choices for freezing degrees of freedom. The principle is to use MD trajectories to determine which groups of atoms move collectively and can thus be frozen and employed in a well-founded rigid-body model of DNA.
We have recently developed methods for detecting rigid-body motions within MD trajectories (Gaudin et al,Hery at al), which also allow us to study what effects neglecting the residual deformations of these bodies have on the internal dynamics of a molecule (Genest, 1996,Genest, 1998,Briki and Genest, 1995). These methods have been tested on two different double-stranded DNA sequences, an octanucleotide and a dodecanucleotide. It was found that each nucleotide could be satisfactorily modeled by three rigid bodies: the base, the sugar ring, and an extended phosphate group (PO4+C5′). However, the generality of these results was limited because 1) the MD simulations were short (200–250ps), 2) they were restricted to fluctuations around the B-DNA conformation, and 3) they were limited to a single force field (GROMOS 87; van Gunsteren and Berendsen, 1986). It was also noted that although only one set of rigid bodies was tested, other solutions were possible.
In this study, we present a much more thorough investigation for two new 15-bp sequences. The first one, GCGTATATAAAACGC, includes a strong binding site (TATATAA) for the TATA box binding protein (TBP), and the second one, GCGTAAAAAAAACGC (with two T → A mutations underlined), includes a weak binding site. Each sequence was simulated twice, using different initial structures, and each simulation lasted 1ns (Flatters et al., 1998; Flatters and Lavery, 1998). The simulations were performed with AMBER, using the Parm94 force field (Pearlman et al,Cornell et al), and with particle mesh Ewald summations to avoid short-range electrostatic cutoffs (Darden et al,Cheatham et al). During these simulations A ↔ B transitions occurred for the central segment of the oligonucleotides.
This much more extensive data set allows us to overcome the restrictions applying to our earlier analysis and to define a hierarchy of possible rigid body models with known accuracy. We use one of the resulting models to construct a bead representation of the oligomers studied and to extract the torsional, bending, and stretching rigidities, which can be compared with experiment. We also use the sequence differences between the two oligomers to look at their impact on the rigid-body models.
Four 1-ns trajectories are analyzed, corresponding to two different 15-bp DNA oligonucleotides (1: GCGTATATAAAACGC, 2: GCGTAAAAAAAACGC), each simulated twice with different initial configurations: 1) all sugars have B-like C2′-endo puckers and 2) with A-like C3′-endo puckers for eight central base pairs (with the exception of the 3′-termini) in line with the conformations observed in the TBP-DNA complex (Kim et al). These two different initial conditions are referred to as B and BAB, respectively. An analysis of these four simulations, relating dynamic behavior to their known propensity for binding TBP, has recently been reported (Flatters et al,Flatters and Lavery, 1998).
All simulations were performed with the AMBER 4.1 program (Pearlman et al), using the Parm94 all-atom force field (Cornell et al). Each simulation was carried out in a box of ∼4500 water molecules, and electrostatic neutrality was achieved by adding 28 Na+ ions. Electrostatic interactions were treated with a particle mesh Ewald summation (Darden et al,Cheatham et al). A 2-fs time step was used in conjunction with SHAKE constraints (Ryckaert et al) on all bond lengths involving hydrogen atoms, and overall translations and rotations of DNA were removed. Configurations were stored every 0.5ps, leading to a total of 1900 configurations for each trajectory.
The principle of searching for quasirigid atom groups is based on an analysis of the matrix of the root mean square fluctuations (RMSs) of interatomic distances (limited in this analysis to all nonhydrogen atoms). Thus, two atoms, i and j, belong to the same rigid body if the RMS of separation rij (〈rij2〉1/2) is smaller than a fixed tolerance rc. Choosing rc makes it possible to build a Boolean decision matrix D for which each element Dij is 1 if 〈rij2〉1/2<rc and 0 otherwise. Different procedures can then be used to determine which atoms should be grouped (Gaudin et al). This leads to a reordered D matrix, which can be displayed graphically.
To maintain the notion of base sequence, we currently focus on groups for which all of the atoms belong to a single nucleotide or to a complementary nucleotide pair. For generality, we accept only those groups involving equivalent atoms for each nucleotide (or complementary nucleotide pair) within all of the trajectories studied.
Let {Fm} be a frame bound to the molecule that is invariant over the time course of a simulation, inasmuch as tumbling and translational motions are suppressed. Let us consider the first configuration of the trajectory, and let {Fg1} be the frame defined by the principal axes of a chosen group of atoms. The position of the center of mass and the orientation of the axes are defined relative to {Fm}. The first configuration of this group is now successively fitted to the other configurations k of this group, using the superposition method of McLachlan, 1979. This corresponds to a translation of the center of mass of the rigid body and a rotation defined by three Euler angles, moving {Fg1} into {Fgk}. The coordinates of each atom in the group at configuration k can then be computed with respect to {Fgk}, thereby defining the deformation of the group. Time series of these coordinates are also straightforward to calculate. We define the time series of the center of mass of a group within the molecular frame as
![]() | (1a) |
![]() | (1b) |
The corresponding root mean square fluctuations are given by
![]() |
![]() |
![]() | (2b) |
As explained in the previous paragraph, the motion of each group of atoms may be decomposed into a translation, a rotation, and a deformation. The approximation of rigid groups is fully valid only if the deformation of any group and the rigid-body motions of other groups are uncorrelated. A method for quantifying this has recently been described (Briki and Genest, 1994,Briki and Genest, 1995,Genest, 1996). Briefly, let n and m (we assume n≤m) be the number of coordinates describing the various motions of two groups composed of Na and N′a atoms, respectively (note, n=m=3 for translation and rotation, while n=3Na and m=3N′a for deformations). These two sets of coordinates can be considered as N-dimensional vectors (with N equal to the number of configurations) that define, respectively, n- and m-dimensional subspaces. The correlation between the two groups of vectors is related to the relative orientations of the two subspaces. Let R11 be the correlation matrix for the first set of vectors, let R22 be the correlation matrix for the second set, let R12 be the correlation matrix between vectors of the first group and the second group, and let R21 be the transpose of R12. We further define a square symmetrical positive definite matrix [R]=[R11]−1 · [R12] · [R22]−1 · [R21]. A canonical correlation coefficient can be defined that is related to the trace (Tr) of [R] by M={(1/n)Tr(R)}1/2 (Briki and Genest, 1994). In practice, the n and m vectors are not the actual coordinates, but their normalized deviation from the corresponding mean. If the components of the n vectors are taken at the same time as those of the m vectors for calculating the correlation matrices, one gets an equal time correlation coefficient, whereas if a time delay is introduced between the two sets of components, a time correlation function may be calculated as a function of the delay. Similarly, comparing one set of vectors with itself allows autocorrelation functions to be computed, while using two different sets of vectors leads to cross-correlation functions.
Let us consider two consecutive base pairs to be represented by two rigid beads linked by a virtual bond between their centers of mass C1 and C2. Let {u1, v1, w1} and {u2, v2, w2} be the principal axes of each bead calculated at the first step of the simulation. Owing to the shape of the base pairs, the two first axes of each bead lie roughly in the mean planes of the base pairs, while the third is perpendicular to these planes. As a consequence of rigid body motions, C1 and C2 translate and the axes rotate as a function of time. We define a local bending by the angle β between w1 and w2, a local twisting by the angle τ between the planes (C1C2, u1) and (C1C2, u2), and a local stretching by the distance l=|C1C2|. According to an earlier bead model (Chirico and Langowski, 1994,Klenin et al) the associated elastic potentials are
![]() | (3a) |
![]() | (3b) |
![]() | (3c) |
![]() | (4a) |
![]() | (4b) |
![]() | (4c) |
For S a global value has also been calculated according to two different procedures. In the first, l0 and 〈(Δl)2〉 in Eq. (4c) are related to the distance between the first and last beads of the sequence, while, in the second, they are related to the sum of the distances between consecutive beads, which is in fact the length of the DNA sequence.
Following the value chosen for the tolerance factor rc, different rigid-body models of DNA are generated. The largest root mean square fluctuation among the full set of interatomic distances is ∼0.45±0.05nm for each of the four trajectories. Setting rc to this value thus assimilates the entire oligomers as single rigid bodies. Decreasing rc to 0.15nm allows us to extract six rigid bodies, each composed of several nucleotides, either on the same strand or on different strands of the oligomers. However, these intermediate level rigid bodies are not well defined, as different results are obtained from the different trajectories studied. Decreasing rc to 0.1nm increases the number of rigid bodies but does not improve their definition.
We therefore turned to a systematic analysis of each nucleotide and of each complementary nucleotide pair, to look for general rules governing the definition of rigid groups of atoms. In this way, five different rigid-body models (see Fig. 1 and Table 1) can be constructed as a function of rc. For a very small rc (≤0.012nm), every nucleotide can be described by four rigid groups: the base, the sugar ring, the phosphate, and C5′ (model 1). When rc reaches 0.020nm, the C5′ atoms can be regrouped with the phosphate (model 2). For rc=0.030nm, paired bases can be grouped together, while the sugars and the PO4+C5′ groups remain separated, leading to a total of five rigid bodies for each complementary nucleotide pair (model 4). For rc=0.044nm, it is found that each sugar can be grouped with its associated base, so that each nucleotide is defined by only two rigid bodies (model 3). Finally, for rc=0.07nm the sugar-base groups of a nucleotide pair can be coalesced, leaving only two backbone PO4+C5′ groups (model 5). At this level, it is still impossible to group together all atoms belonging to two consecutive nucleotides. In addition, it is only possible to group together all of the atoms of any given nucleotide when rc>0.12nm.
For each MD trajectory, the time series of the center-of-mass coordinates and of the Euler angles defining the position and the orientation of each group frame {Fgk} (see Methods) were calculated with respect to the oligonucleotide-bound axis system {Fm}. These time series describe the rigid-body motions of each group within the molecule. The time series of the atomic coordinates relative to {Fgk} of each group were also computed, describing the internal deformation of the group during the simulation. It is therefore possible to compare the relative amplitudes of the rigid body and internal fluctuations of any given group for a particular trajectory. The amplitudes are defined by the RMS fluctuations of the center of mass and the mean atomic RMS fluctuation of the deformation (Eqs. (2a)). These RMS values, averaged over all equivalent groups, are presented in Table 2,Table 3 for each trajectory. The groups corresponding to single bases are excluded because of their very small deformations. It can be seen from these results that the groups composed of a single sugar ring or of an extended phosphate group (PO4+C5′) in models 2 and 4 undergo translational fluctuations that are at least an order of magnitude higher than their deformation fluctuations. For the other groups this ratio is reduced by at least a factor of 2.
| Table 2 Center-of-mass fluctuations RMSCM in nm (Eq. (2a)) of the different groups within the molecular reference frame |
| Models | 1, 2, 4 | 2, 3, 4, 5 | 3 | 4 | 5 | ||
|---|---|---|---|---|---|---|---|
| Groups | SU | SK | SU+BA | BA−BA | SU−BA … BA−SU | ||
| 1-B | 0.090 | 0.110 | 0.076 | 0.068 | 0.068 | ||
| 1-BAB | 0.073 | 0.096 | 0.061 | 0.055 | 0.070 | ||
| 2-B | 0.096 | 0.116 | 0.084 | 0.073 | 0.075 | ||
| 2-BAB | 0.078 | 0.098 | 0.066 | 0.060 | 0.060 | ||
| Table 3 Average atomic fluctuations RMSat in nm (Eq. (2b)) of the different groups within a group-bound reference frame |
| Models | 1, 2, 4 | 2, 3, 4, 5 | 3 | 4 | 5 | ||
|---|---|---|---|---|---|---|---|
| Groups | SU | SK | SU+BA | BA−BA | SU−BA … BA−SU | ||
| 1-B | 0.0062 | 0.0048 | 0.011 | 0.013 | 0.020 | ||
| 1-BAB | 0.0067 | 0.0047 | 0.011 | 0.014 | 0.021 | ||
| 2-B | 0.0055 | 0.0048 | 0.010 | 0.012 | 0.017 | ||
| 2-BAB | 0.0073 | 0.0049 | 0.013 | 0.013 | 0.019 | ||
In the case of models 2 and 4, which have particularly small deformations, we have computed the time series of the mean atomic position in the frame bound to the groups (Eq. (1b)) and of their center of mass within the molecular frame (Eq. (1a)). Typical examples are given in Fig. 2. Two types of behavior are observed. Some time series show a few distinct conformational transitions, while others exhibit fluctuations around an average conformation or position. This second type of time series was Fourier transformed, leading to the data shown in Fig. 3. It is seen that low frequencies are only strongly represented in the position spectrum.
To avoid end effects, only the nine central base pairs of the oligomers have been analyzed, and comparisons are limited to models 2 and 4 (the minor deformations of the bases in model 2 are ignored). As mentioned above, the time series of certain groups exhibits a small number of transitions that do not allow statistically accurate correlation coefficients to be obtained. In such cases it is still possible to detect correlations qualitatively by a visual comparison of the corresponding time series. An example is given in Fig. 4 for the translational motion of two adjacent sugar rings. However, such cases concern less than 30% of all nucleotides.
We first note that the equal time translational correlation between different groups is large. The correlation decreases only slowly as the distance between the groups increases. An average value of 0.88 is obtained for covalently linked groups, against 0.50 for groups separated by six nucleotides. Rotational motions show a similar behavior, although the values are significantly lower (0.50 for linked groups and 0.25 for distant groups).
In sharp contrast, there is very little correlation between the deformations of different groups, whatever their separation (∼0.15). These findings are in good agreement with previous studies (Briki and Genest, 1995,Genest, 1996). No significant differences are found between nucleotides, between the two oligonucleotide sequence, or between the trajectories.
It is also interesting to note that the deformations of the sugars or of the extended phosphate groups are only very weakly correlated with the translational and rotational motions of other groups (< 0.25) and consequently have no significant effect on the rigid-body motions of other groups. This result holds even between a sugar and its associated base (model 2) or base pair (model 4).
A somewhat different result is found for the equal time correlation between the deformation of a sugar or of an extended phosphate group and its own rigid-body motions. Although no effect is seen on translation, a correlation of ∼0.4–0.5 is found between deformation and rotation. Equal time correlation coefficients between the deformation of a base pair in model 4 and rigid-body motions of other base pairs also reveal a small correlation (0.25–0.33), but a problem of accuracy may occur in this case (see below).
Examples of canonical cross-correlation functions between group deformations and the deformation or rigid-body motions of other groups are shown over 100ps in Fig. 5. It can be seen that a given group deformation has no effect on either the deformations or the rigid-body motions of other groups, even after a delay of 100ps. This is true for any sugar or extended phosphate groups of models 2 and 4. However, the rigid-body motions of different groups are found to be correlated over a time period of ∼40ps (Fig. 6).
This analysis has been performed with model 4 to formally reproduce the bead models used by other authors (Allison and McCammon, 1984,Chirico and Langowski, 1994,Klenin et al). It is again limited to the nine central base pairs to avoid oligomeric end effects. Each base pair is assumed to be a single rigid bead. The sugar and extended phosphate groups play the role of springs linking the beads. The configuration averaged values and the corresponding RMS fluctuations of local twist and bending angles and of the distances between consecutive beads were calculated for each base pair and each trajectory. Torsional, bending, and stretching rigidities were then evaluated. Although some differences are observed between beads, it is not possible to assign these to sequence or trajectory-linked effects in any reliable way. The bead-averaged values are given in Table 4. These values are on the same order of magnitude as those obtained experimentally, with the exception of the stretching rigidity (mean value=3760 pN), which is significantly higher than values given by Cluzel et al and by Smith et al, which are on the order of 900-1100 pN.
| Table 4 Stretching (S), bending (B), and torsional (C) rigidities of the bead model calculated from model 4 |
| Property | 1-B | 1-BAB | 2-B | 2-BAB | ||
|---|---|---|---|---|---|---|
| l0 (nm) | 0.361 | 0.390 | 0.366 | 0.362 | ||
| 〈Δl2〉 (nm2) | 5.5×10−4 | 4.5×10−4 | 5.8×10−4 | 3.7×10−4 | ||
| S (pN) | 3605±1486 | 4022±1630 | 2746±756 | 4007±456 | ||
| 〈Δβ2〉 (rad2) | 7.92×10−3 | 6.73×10−3 | 6.43×10−3 | 8.13×10−3 | ||
| B (×10−19 erg · cm) | 4.5±1.9 | 5.4±2.0 | 5.2±1.7 | 4.9±2.3 | ||
| 〈Δγ2〉 (rad2) | 15.00×10−3 | 7.23×10−3 | 11.00×10−3 | 6.79×10−3 | ||
| C (×10−19 erg · cm) | 1.20±0.55 | 2.65±0.82 | 1.53±0.44 | 2.78±1.0 | ||
| l0 is the average (over the configurations and over the beads) distance between the centers of mass of two consecutive base pairs, and 〈Δl2〉 is the corresponding RMS fluctuation (nm). 〈Δβ2〉 and 〈Δγ2〉 are, respectively, the equivalent average RMS fluctuations of the bending and torsion angles between two base pairs. |
Because experimental measurements of DNA stretching give global information on long, single molecules and not local information at the level of individual base pairs, we have used two other procedures for calculating a more macroscopic S value (see Methods). In the first, the average distance and the corresponding RMS fluctuations are measured between the two separated base pairs A4 and T12, while in the second, the sum of the interbead distances lying between A4 and T12 is calculated for each conformation of the simulation, and its average and RMS fluctuations are used to determine S. We find that these two procedures lead to a decrease in S, by a factor of 2.5 in the first case (mean value 1530 pN) and by a factor of 1.7 in the second case (mean value 2260 pN).
This study provides a quantitative justification for five different rigid-body models of DNA. The models that represent each nucleotide or complementary nucleotide pair by a few beads are valid for oligomers with different sequences and for both A- and B-like helical conformations. These beads are closely related to chemical entities within the DNA molecule and can be expected to be valid for other base sequences. Each model has a defined degree of accuracy based on interatomic distance fluctuations. Models 1 and 2 defined here are identical to those reported earlier on the basis of two short MD simulations (200–250ps) limited to B-DNA (Gaudin et al) and using a different force field. The present work increases the generality of these definitions and lays a firm foundation for the formulation of mesoscopic models of DNA, lying in the relatively unexplored area between atomic and elastic rod representations, and enabling sequence-dependent effects to be maintained.
The rigid-body models we define can be divided into two categories: 1) models 1, 2 and 3, where beads contain atoms belonging to a single nucleotide, and 2) models 4 and 5, where certain beads contain atoms from complementary nucleotides. The precision of the models decreases in the order 1, 2, 4, 3, 5. In all models, the bases, the sugar, and the phosphate group can be treated as rigid. Although it is possible in some models to assimilate a sugar and a base or a base pair within a single rigid body, the phosphate group bead cannot be extended beyond the adjacent C5′ atom.
For models 2 and 4 we have used canonical correlation analysis to determine whether freezing internal degrees of freedom within a bead can influence the rigid-body motions. These two models share backbones divided into sugars and extended phosphate groups. We have been able to show that the deformation of these beads is uncorrelated with the dynamics of the other beads (including between a sugar and the associated base) or with the bead's own position. However, there is correlation with the bead's orientation, leading to indirect correlations with the motions of other beads. Consequently, in our mesoscopic DNA representation, sugar or extended phosphate deformations will become rotations during the fitting process and implicitly affect the dynamics of other beads.
In contrast to the backbone beads, base deformations in models 1 and 2 are too small to have significant effects. With base pair beads (model 4), however, we have seen a moderate correlation (0.25–0.33) with the rigid-body motions of other base pairs. However, in this case we need to question the accuracy of the calculations. The number of variables n describing internal dynamics is three times higher for an AT base pair (57 variables, without hydrogens) than for an extended phosphate group (18 variables), and the number of available conformations N is only 1900 for each trajectory. Increasing the number of variables strongly affects the accuracy of the correlation matrix elements, which is not compensated for by an increase in sampling. Although the uncertainty of the canonical correlation values is difficult to estimate, it certainly does not scale far from [n/N]1/2 (Girshick, 1939).
The most appropriate model for subsequent studies will depend on the property to be investigated. It is clear that none of the models proposed can be used for structural determination from NMR or x-ray data, because this requires atomic resolution. It should also be noted that our models do not resolve sugar ring puckering. However, we can specify the generic use of the various mesoscopic representations:
Model 2 (and, a fortiori, model 1) is appropriate for analyzing the dynamic behavior of double-stranded DNA sequences, using the inter- and intrabase or base pair helical parameters defined at the EMBO meeting in Cambridge (Dickerson et al). These parameters describe the orientation and the position of single bases or of complementary base pairs. This information is conserved in model 2 because we have shown that no correlation exists between the deformation of the extended phosphate or of the sugar puckering conformations and the rigid-body motions of the bases. Thus using three rigid objects per nucleotide in place of an all-atom representation should not cause any loss of accuracy in the determination of the helical parameters. It should be noted that model 2 (or model 1) should be applicable to both single- and triple-stranded helices and that, provided the effective force fields are appropriate, nothing opposes base pair disruption in these models.
Model 4 is more restrictive, as it implies that complementary bases always remain hydrogen bonded. This is a valid assumption for the simulation of double-stranded DNA sequences when the relative motions of individual bases within a pair can be neglected and could reasonably apply to long DNA fragments at room temperature for periods in the nanosecond range. This model can still be used for studying interbase pair parameters, because the orientation and position of individual base pairs are, at most, very weakly coupled to any other rigid bodies, but its use naturally implies that intra-base pair parameters can no longer be monitored.
Models 3 and 5 correspond to rougher approximations. Although we have not calculated the correlation between the internal deformations of the corresponding rigid bodies and their translation and rotation (because the ratio of the number of configurations to the number of variables is too small), one can expect that as the distance between two rigid bodies increases, this correlation decreases. Therefore it is certainly possible to use these models for simplifying the calculation of long-range interactions. This also implies force-field simplifications, such as the use of multipole expansions for treating the electrostatic interactions between distance beads.
It is finally remarked that it is perfectly feasible to combine several of these models in a “reaction center” approach, using either an all-atom representation or the more refined bead models (1, 2, or 4) at the point of interest, combined with more approximate representations (models 3 or 5) and correspondingly simplified force fields (harmonic?) for more distant nucleotides.
The quality of simulations using a bead model depends both on the validity of freezing internal degrees of freedom of the beads and on the treatment of the interbead interactions. A number of authors have used the notion of beads to study long DNA filaments, especially the hydrodynamics of supercoiled DNA (Tan and Harvey, 1989,Chirico and Langowski, 1994,Jian et al,Klenin et al). However, most authors used a priori defined beads, making it difficult to predict whether the associated errors can be safely ignored for a specific application (although some of these models correspond to our results, as, for example, in the case of the Tan and Harvey, 1989 base plane approach and our model 4). Our study shows how this difficulty can be overcome. In the discussion above we are able to specify where models 1, 2, and 4 can be used safely, with no significant loss of accuracy in the determination of helical parameters due to the introduction of beads. In addition, and in contrast to nearly all present models, this approach makes it possible to conserve sequence-dependent features.
Interbead potentials remain to be calculated, but several general remarks can again be made. First, a simple, spherically symmetrical, united-atom model for each bead is certainly not appropriate, because it excludes the anisotropic effects due to shape and to charge distribution. Similarly, a refined treatment of the interaction forces and torques between two linked rigid bodies cannot be reduced to harmonic potentials. We return to this point shortly.
By extracting data corresponding to the base pair bead model 4 from our all-atom dynamics, we have calculated the local twisting, bending, and stretching rigidities of DNA. Experimental estimations of rigidities are roughly 1–4×10−19 erg·cm (Barkley and Zimm, 1979,Thomas et al,Millard et al) for twisting and 2–3×10−19 erg·cm (Barkley and Zimm, 1979,Millard et al) for bending. Our values are in reasonable agreement with these results. In contrast, the stretching rigidity resulting from the experiments of Cluzel et al and Smith et al, ∼1100 pN, is more than three times smaller than the average value we obtain at the level of individual base pairs (3760 pN). However, it should not be forgotten that the experimental value refers to the overall stretching of long DNA polymers, while our computation refers to an average local value between two successive base pairs. We examined this point by considering more distant base pairs for computing S with Eq. (4c). By using either the distance between the first and the last base pairs analyzed or the sum of the interbead distances (a better estimate of the length of the DNA filament), we obtain a significant decrease in S (1530–2260pN). If we had studied a longer DNA fragment we could reasonably hope to approach the experimental result. The important point is that this decrease clearly reflects a negative correlation between the stretching of successive interbead distances. This correlation will be totally absent in a priori bead models that are parameterized at the interbead level to reproduce macroscopic elastic properties. The consequences of such approximations, like those related to the arbitrary choice of beads, are again difficult to predict.
We intend to use the rigorous bead definitions set out here for future studies of DNA's elastic and hydrodynamic properties. For this, model 4 should be appropriate and already represents a 6.5 times reduction in the number of variables compared to an all-atom simulation. Force-field development, including sequence effects, will certainly require denser sampling of all-atom trajectories. Long simulations are also important, as shown by the small delayed correlation between bead deformation and the rigid-body motions of other beads observed on the basis of short simulations (Genest, 1996), but they are absent here. The resulting models will be appropriate for energy minimization and Monte Carlo simulations and can be extended to dynamics studies by the addition of viscous solvent effects.
Allison and McCammon, 1984 (1984). Multistep Brownian dynamics: application to short wormlike chain. Biopolymers 23, 363–375. CrossRef | PubMed
Barkley and Zimm, 1979 (1979). Theory of twisting and bending of chain macromolecules: analysis of the fluorescence depolarization of DNA. J. Chem. Phys. 70, 2991–3007. CrossRef | PubMed
Briki and Genest, 1994 (1994). Canonical analysis of correlated atomic motions in DNA from molecular dynamics simulation. Biophys. Chem. 52, 35–43. CrossRef | PubMed
Briki and Genest, 1995 (1995). Rigid body motions of sub-units in DNA: a correlation analysis of a 200ps molecular dynamics simulation. J. Biomol. Struct. Dyn. 12, 1063–1082. PubMed
Cheatham et al., 1995 (1995). Molecular dynamics simulations on solvated biomolecular systems: the particle mesh Ewald method leads to stable trajectories of DNA, RNA and proteins. J. Am. Chem. Soc. 117, 4193–4194. CrossRef | PubMed
Chirico and Langowski, 1994 (1994). Kinetics of DNA supercoiling studied by Brownian dynamics simulation. Biopolymers 34, 415–433. CrossRef | PubMed
Cluzel et al., 1996 (1996). DNA: an extensible molecule. Science 271, 792–794. PubMed
Cornell et al., 1995 (1995). A second generation force field for the simulation of proteins, nucleic acids and organic molecules. J. Am. Chem. Soc. 117, 5179–5197. CrossRef | PubMed
Darden et al., 1993 (1993). Particle mesh Ewald: an N·log(N) method for Ewald sums in large systems. J. Chem. Phys. 98, 10089–10092. CrossRef | PubMed
Dickerson et al., 1989 (1989). Definitions and nomenclature of nucleic acid structure parameters. J. Mol. Biol. 205, 787–791. CrossRef | PubMed
Feig and Pettitt, 1998 (1998). Structural equilibrium of DNA represented with different force fields. Biophys. J. 75, 134–149. Abstract | Full Text | PDF (542 kb) | PubMed
Flatters and Lavery, 1998 (1998). Sequence-dependent dynamics of TATA-box binding sites. Biophys. J. 75, 372–381. Abstract | Full Text | PDF (500 kb) | PubMed
Flatters et al., 1997 (1997). Conformational properties of the TATA-box binding sequence of DNA. J. Biomol. Struct. Dyn. 14, 757–765. PubMed
Gabb et al., 1997 (1997). Collective-variable Monte Carlo simulation of DNA. J. Comp. Chem. 18, 2001–2011. PubMed
Gaudin et al., 1997 (1997). Search for rigid subdomains in DNA from molecular dynamics simulations. J. Biomol. Struct. Dyn. 15, 357–367. PubMed
Genest, 1996 (1996). How long does DNA keep the memory of its conformation? A time-dependent canonical correlation analysis of molecular dynamics simulation. Biopolymers 38, 389–399. CrossRef | PubMed
Genest, 1998 (1998). Motion of groups of atoms in DNA studied by molecular dynamics simulation. Eur. Biophys. J. 27, 283–289. CrossRef | PubMed
Genest and Wahl, 1978 (1978). Fluorescence anisotropy decay due to rotational Brownian motion of ethidium intercalated in double strand DNA. Biochim. Biophys. Acta 52, 502–509. CrossRef | PubMed
Girshick, 1939 (1939). On the sampling theory of roots of determinantal equations. Ann. Math. Stat. 10, 203–224. PubMed
Guéron et al., 1987 (1987). A single mode of base pair opening drives imino proton exchange. Nature 328, 89–92. CrossRef | PubMed
Hery at al., 1998 (1998). X-ray diffuse scattering and rigid-body motions in crystalline lysozyme probed by molecular dynamics simulation. J. Mol. Biol. 279, 303–319. CrossRef | PubMed
Hogan and Jardetzsky, 1980 (1980). Internal motions in deoxyribonucleic acid II. Biochemistry 19, 3460–3468. PubMed
Jian et al., 1998 (1998). Internal motion of supercoiled DNA: Brownian dynamics simulations of site juxtaposition. J. Mol. Biol. 284, 287–296. CrossRef | PubMed
Kim et al., 1993 (1993). Crystal structure of a yeast TBP/TATA-box complex. Nature 365, 512–520. CrossRef | PubMed
Klenin et al., 1998 (1998). A Brownian dynamics program for the simulation of linear and circular DNA and other wormlike chain polyelectrolytes. Biophys. J. 74, 780–788. Abstract | Full Text | PDF (227 kb) | PubMed
Lafontaine and Lavery, 1999 Lafontaine, I., and R. Lavery. 1999. Reduced coordinate models of nucleic acids. Curr. Opin. Struct. Biol. (in press)..
Lavery, 1994 (1994). Modeling nucleic acids: fine structure, flexibility and conformational transitions. Adv. Comput. Biol. 1, 69–145. PubMed
Leroy et al., 1985 (1985). Proton exchange and base pair opening of poly(rA)·poly(rU) and poly(rI)·poly(rC). J. Mol. Biol. 184, 165–168. CrossRef | PubMed
McLachlan, 1979 (1979). Gene duplication in the structural evolution of chymotrypsin. J. Mol. Biol. 128, 49–79. CrossRef | PubMed
Millard et al., 1988 (1988). Modification of DNA dynamics by platinum drug binding: a time-dependent fluorescence depolarization study of the interaction of cis- and trans- diamminedichloroplatinum(II) with DNA. Biochemistry 27, 8599–8606. PubMed
Olson, 1996 (1996). Simulating DNA at low resolution. Curr. Opin. Struct. Biol. 6, 242–256. CrossRef | PubMed
Packer and Hunter, 1998 (1998). Sequence dependent DNA structure: the role of the sugar-phosphate backbone. J. Mol. Biol. 280, 407–420. CrossRef | PubMed
Pearlman et al., 1995 In AMBER 4.1. Pearlman, D.A., Case, D.A., Caldwell, G.W., Ross, W.S., Cheatham, T.E., Ferguson, D.M., Seibel, G.L., Singh, U.C., Weiner, P., Kollman, P.A., eds. (San Francisco, CA: University of California at San Francisco). PubMed
Ryckaert et al., 1977 (1977). Numerical integration of Cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J. Comp. Physiol. 23, 327–341. PubMed
Schlick, 1995 (1995). Modeling superhelical DNA: recent analytical and dynamic approaches. Curr. Opin. Struct. Biol. 5, 145–262. PubMed
Schlick and Olson, 1992 (1992). Supercoiled DNA energetics and dynamics by computer simulation. J. Mol. Biol. 223, 1089–1119. CrossRef | PubMed
Smith et al., 1996 (1996). Overstretching DNA: the elastic response of individual double-stranded and single-stranded DNA molecules. Science 271, 795–799. PubMed
Sprous et al., 1998 (1998). Molecular dynamics studies of the conformational preferences of a DNA double helix in water and an ethanol/water mixture. Theoretical considerations of the A to and from B transition. J. Phys. Chem. 102, 4658–4667. PubMed
Tan and Harvey, 1989 (1989). Molecular mechanics model of supercoiled DNA. J. Mol. Biol. 205, 573–591. CrossRef | PubMed
Thomas et al., 1980 (1980). Torsion dynamics and depolarization of fluorescence of linear macromolecules. II. Fluorescence polarization anisotropy measurements on a clean viral ϕ29 DNA. Biophys. Chem. 12, 177–188. CrossRef | PubMed
van Gunsteren and Berendsen, 1986 (1986). GROMOS 87 package, Biomos. Biomolecular Software. (Groningen, the Netherlands: University of Groningen). PubMed
Wahl et al., 1970 (1970). Decay of fluorescence emission anisotropy of the ethidium bromide-DNA complex. Evidence for an internal motion in DNA. Proc. Natl. Acad. Sci. USA 65, 417–421. CrossRef | PubMed
Zhurkin et al., 1991 (1991). Static and statistical bending of DNA evaluated by Monte Carlo simulations. Proc. Natl. Acad. Sci. USA 88, 7046–7050. CrossRef | PubMed