| Cumulant Analysis in Fluorescence Fluctuation Spectroscopy Biophysical Journal, Volume 86, Issue 6, 1 June 2004, Pages 3981-3992 Joachim D. Müller Abstract A novel technique for the analysis of fluorescence fluctuation experiments is introduced. Fluorescence cumulant analysis (FCA) exploits the factorial cumulants of the photon counts and resolves heterogeneous samples based on differences in brightness. A simple analytical model connects the cumulants of the photon counts with the brightness and the number of molecules in the optical observation volume for each fluorescent species. To provide the tools for a rigorous error analysis of FCA, expressions for the variance of factorial cumulants are developed and tested. We compare theory with experiment by analyzing dye mixtures and simple fluorophore solutions with FCA. A comparison of FCA with photon-counting histogram (PCH) analysis, a related technique, shows that both methods give identical results within experimental uncertainty. Both FCA and PCH are restricted to data sampling times that are short compared to the diffusion time of molecules through the observation volume of the instrument. But FCA theory, in contrast to PCH, can be extended to treat arbitrary sampling times. Here, we derive analytical expressions for the second factorial cumulant as a function of the sampling time and demonstrate that the theory successfully models fluorescence fluctuation data. Abstract | Full Text | PDF (196 kb) |
| The Kinetics of Phase Separation in Asymmetric Membranes Biophysical Journal, Volume 88, Issue 6, 1 June 2005, Pages 4072-4083 Elizabeth J. Wallace, Nigel M. Hooper and Peter D. Olmsted Abstract Phase separation in a model asymmetric membrane is studied using Monte Carlo techniques. The membrane comprises two species of particles, which mimic different lipids in lipid bilayers and separately possess either zero or non-zero spontaneous curvatures. We study the influence of phase separation on membrane shape and the influence of the coupling of composition and height dynamics on phase separation and domain growth, via both the degree of shape asymmetry and relative kinetic coefficients for height relaxation. Abstract | Full Text | PDF (219 kb) |
| Parameter Inference for Biochemical Systems that Undergo a Hopf Bifurcation Biophysical Journal, Volume 95, Issue 2, 15 July 2008, Pages 540-549 Paul D.W. Kirk, Tina Toni and Michael P.H. Stumpf Abstract The increasingly widespread use of parametric mathematical models to describe biological systems means that the ability to infer model parameters is of great importance. In this study, we consider parameter inferability in nonlinear ordinary differential equation models that undergo a bifurcation, focusing on a simple but generic biochemical reaction model. We systematically investigate the shape of the likelihood function for the model's parameters, analyzing the changes that occur as the model undergoes a Hopf bifurcation. We demonstrate that there exists an intrinsic link between inference and the parameters’ impact on the modeled system's dynamical stability, which we hope will motivate further research in this area. Abstract | Full Text | PDF (594 kb) |
Copyright © 2007 The Biophysical Society. All rights reserved.
Biophysical Journal, Volume 92, Issue 12, 4168-4178, 15 June 2007
doi:10.1529/biophysj.106.092650
Biophysical Theory and Modeling
Darren B. VanBeek, Matthew C. Zwier, Justin M. Shorb and Brent P. Krueger
, 
Hope College, Department of Chemistry, Holland, Michigan
Address reprint requests to B. P. Krueger, Tel.: 616-395-7629.Fluorescence-detected resonance energy transfer (FRET) has been an important tool in structural biology for more than three decades 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19. Recently, with the availability of single-molecule microscopes and a wide array of techniques for fluorescently labeling proteins and nucleic acids both in vitro and in vivo, the number of studies that utilize FRET has expanded dramatically (Google scholar, in 2005, yielded more than 2000 references that included “FRET”). While FRET is now being applied to a tremendous variety of systems utilizing an array of experimental techniques, many of which could only be imagined while FRET was maturing, most modern FRET studies still make use of the same 30+-year-old approximations, often with little consideration of their applicability to the particular system of study. Here, we present computational examination of the correlation between orientation and distance (κ and RDA, respectively, see below) between the two fluorescent probes. The independence of these two factors has been assumed in the vast majority of FRET studies, yet, to the author's knowledge, has been examined in only two articles 20,21 and even mentioned in only four others 1,22,23,24. This independence approximation clearly breaks down in the chosen system and the physical cause of the breakdown appears to be a quite general result of the commonly-used succinimide linkage between fluorescent label and protein.
In FRET studies, a fluorescence observable is used to determine the rate of resonant energy transfer (kRET) from an energy donor (D) to an energy acceptor (A), where D and A are typically fluorescent moieties that may be native to the system, added chemically, added genetically, or through some combination. The RET rate is often measured through time-resolved experiments that give kRET more-or-less directly or through steady-state experiments that give kRET indirectly via the efficiency of D to A energy transfer relative to D fluorescence. This rate can be related to the distance between the D and A, RDA, through theory developed by Förster in the 1940s 25,26,27. The Förster equation appears in many forms 1,2,25,28 such as
![]() | (1) |
and the absorption spectrum of the A,
on a wavenumber scale. Note that the values of ϕD, τD, and JDA have been tabulated for many D-A pairs 28 and are generally taken to be independent of any structural dynamics in the system such that![]() | (2) |
If one assumes a particular value for κ2 (often 2/3, see below) then the measured kRET can be related to the valuable structural parameter, RDA. Since the locations of the D and A on the system of interest are usually well-known, one has now learned something about the structure of the protein or nucleic acid under investigation. Or, if the D and A are attached to two different bodies, one has an easily-observed assay of whether those two bodies are bound or not, allowing determination of, e.g., the ΔG of binding.
In the 1970s and 80s, pioneering researchers convincingly showed the broad applicability of a number of approximations that proved useful in analysis of FRET data 3,4,5,29,30,31,32,33. Most famous of these is the “κ2 approximation” mentioned above. The value κ is defined as
![]() | (3) |
and
are unit vectors along the transition dipoles of the D and A, respectively, and
is the unit vector along the line connecting the centers of the D and A. If the D and A are free to independently sample all possible orientations, the average value of κ2 can be determined analytically to be 2/3. The validity of this isotropic limit was the focus of many early FRET articles and a number of methods were developed to reduce the uncertainty in RDA resulting from uncertainties in 〈κ2〉 5,31,32,34,35,36,37,38,39. More recently, associated anisotropy measurements have been used to verify whether a distribution in FRET efficiencies results from a distribution of distances or a distribution in orientational mobility of the D 19,40. Despite these efforts, most experiments to date have simply assumed the value of 2/3 and their general success suggests that under many common conditions the difference between the actual 〈κ2〉 and 2/3 leads to modest errors in determining RDA18,28,37,41.However, by substituting the average value of κ2 into Eq. (2), every researcher that has thoughtfully (or blindly) employed the κ2 approximation has implicitly also assumed that κ2 is independent of RDA, i.e., that
![]() | (4) |
More recently, a number of groups have used molecular dynamics (MD) simulations to aid understanding of FRET experiments 23,24,42,43,44,45,46,47,48. Many of these works 23,24,42,46,47,48 examined 〈κ2〉 and found varying degrees of agreement between their simulations and the isotropic limit. Four of these works 23,24,42,47 carefully evaluated the electronic coupling between donor and acceptor at every snapshot and, therefore, did properly account for correlation between κ and RDA, though only two actually mentioned the possibility 23,24 and neither treated it in detail.
While the success of many careful FRET experiments suggests that κ and RDA must be reasonably independent in most cases, it is important to note that for most of its history of use in biology, FRET has been applied to proteins that are small, rigid, and soluble. In contemporary studies, FRET is being applied to systems of remarkable diversity including soluble proteins, membrane proteins, and nucleic acids of almost every conceivable shape and size as well as large protein-protein and protein-nucleic acid complexes. Many of these systems are also far from rigid, undergoing large-scale structural changes. Since κ and RDA are both structural parameters describing similar molecules in similar environments, it is reasonable to expect that there is a variety of systems in which D-A relative orientation and D-A distance show some degree of correlation. Thus, it is prudent to examine the effect such correlation might have on FRET results in a variety of systems. We begin to explore this arena here, presenting the results of one such examination using the archetypal protein, hen egg-white lysozyme (HEWL).
In the system we have chosen to study (shown in Fig. 1), the D (7-dimethylamino-4-methylcoumarin, i.e., DACM) is attached to HEWL via a succinimide linkage as in typical maleimide conjugation chemistry. The A (Eosin Y, i.e., eosin) binds noncovalently, meaning that its orientation is essentially fixed relative to the protein 49,50,51,52,53. While this is an unusual arrangement compared to classic FRET studies in which both D and A are on flexible linkages, common use of intrinsic probes (e.g., NADP) and the growing popularity of the FlAsH family of fluorescent labels 54 as well as GFP and related fluorophores makes this a system of broad practical interest for contemporary FRET studies, in addition to providing a useful test of the independence of κ and RDA.
The computational methods employed here are described below followed by detailed analysis of the behavior of κ and RDA. Significant correlation between κ and RDA is observed as well as significant deviation of 〈κ2〉 from the theoretically predicted value. The physical basis for the failure of the κ2 approximation and the independence approximation is discussed as well as the impact of these results on FRET experiments.
Because the fluorescent labels represent residues not present in the standard AMBER force fields, their parameterization was required. Atom types for both molecules were chosen through analogy to the generalized AMBER force field (GAFF) 55, which is designed to be compatible with the Cornell et al. force field 56. Normal mode calculations performed using the nmode package of the AMBER 8 suite 57 (University of California, San Francisco, CA) revealed generally good agreement with vibrational modes calculated quantum mechanically (QM) at the DFT(B3-LYP)/6-31G* level and scaled by 0.96 58. All QM calculations were performed using the 1998 version of the Gaussian suite 59 (Gaussian, Pittsburgh, PA). However, the molecular mechanics description of the out-of-plane bending motions of the heterocyclic oxygen present in both molecules were found to be highly exaggerated relative to the QM calculations—amplitudes too high and frequencies too low. Reviewing the parameters revealed that the x-CA-OS-x dihedral parameter in GAFF (not present in Cornell et al.) had a force constant (Vn/2) of 1.8kcal/mol, similar to that between tetrahedral carbons. However, in both dye molecules the involved atoms are part of the aromatic ring system such that we might expect a force constant similar to that found for a six-membered ring of aromatic carbon atoms, or 14.5kcal/mol for both GAFF and Cornell et al. Adjusting the force constant and repeating the normal mode calculations revealed the best agreement with scaled QM frequencies for a force constant of ≈18 kcal/mol, similar to the aromatic carbon force constant. A new atom type was defined—aromatic oxygen, OA—to represent the heterocyclic oxygen present in the aromatic ring systems of many fluorescent probes. For generality and consistency with the force field of Cornell et al., the x-CA-OA-x force constant was assigned a value of 14.5 kcal/mol. All other force constants were assumed to be the same as the OS atom type in GAFF. Charges for both molecules were determined using the RESP package of AMBER 8, which involves restrained fitting to an electrostatic potential derived quantum-mechanically at the HF/6-31G* level 60,61. Structures, assigned atom types, and partial charges for DACM and eosin are provided as Supplementary Material .
The starting structure of HEWL was taken from the microgravity crystal structure (PDB ID: 1BWH) 62 with all waters removed. Residue 47 was changed from threonine to cysteine and bonded to the dye DACM 63,64 via a succinimide group as shown in Fig. 2. This structure was minimized to remove bad Van der Waals contacts. Eosin was placed near the binding site using the xLeap package within the AMBER suite and restraints were invoked to pull the eosin into the binding site during a short MD run. The restraints were chosen to encourage the eosin to bind as deduced from multiple spectroscopic studies and shown in Jordanides et al. 49. The system was then neutralized by adding 5 Cl− ions and solvated with a 20Å buffer of explicit water for a total of 17,712 water molecules (total system size 55,149 atoms). The restraints were removed and all further MD was performed without any restraints.
All MD simulations were performed with the Sander package of AMBER 8 using the isothermal-isobaric (NPT) ensemble. A cutoff of 9Å was used for Van der Waals interactions and particle mesh Ewald 65,66 was used for electrostatic interactions. SHAKE was applied to all bonds with hydrogen 67. The system was equilibrated in four steps:
System equilibration was confirmed by noting that drifts in the total energy and density (as measured by the slopes of linear fits to the final 25ps of equilibration data) were<5% of the short-term fluctuations in each (as measured by the standard deviation). The diffusion of the chloride ions was also measured; all five continued to diffuse with similar diffusion constants through the simulation, confirming that none of the counterions interacted for long times with any specific protein sites.
A total of 37ns of production MD was generated and system coordinates were saved every 200 fs, yielding 185,000 snapshots constituting roughly 700 GB of total structural data. From each coordinate snapshot, the positions and orientations of the two dye molecules were retrieved. The centers of the fluorophores were defined by the mass-weighted average positions of the atoms determined to be involved in the transition. Atoms were judged to be involved in the transition by visual inspection of the transition densities determined at the CIS/6-31G* level. The CIS calculation also provides the orientation of the transition dipole for each dye. A pair of atoms was chosen such that the orientation of the vector between them most closely matched the orientation of the calculated transition dipole. The vector connecting these atoms was then used as a measure of the transition dipole orientation throughout the simulation. The angles between the QM-determined transition dipoles and the atomic approximations to the transition dipoles are 1.75° and 0.002° for DACM and eosin, respectively.
Note that the definition of the transition dipole direction of each probe comes from a single QM calculation performed on an optimized structure of each dye without solvent. This kind of analysis makes two assumptions about the D and A transition dipoles: 1) that the transitions between ground and excited states in both the D and A are each well described by a single transition dipole, fixed to the molecular framework; and 2) that that transition dipole for each probe is insensitive to fluctuations in its environment.
Regarding the first assumption, the Steinberg group pointed out that for a number of common fluorescent probes, the transitions between ground and excited states are better described as a mixture between multiple transitions and, therefore, should be described using multiple transition moments 68. This mixing is particularly important for naphthalene, the subject of the original Steinberg work, as well as the ubiquitous fluorescence probe tryptophan. As an example, the complex fluorescence behavior of tryptophan arises because two low-lying excited singlet states, 1La and 1Lb, are nearly degenerate and nearly orthogonal in their transition dipole orientations. Thus, at many excitation wavelengths, a mixture of the two states is generated, leading to complex photophysical behavior that is revealed through polarization spectroscopy 69,70,71,72. In contrast, the DACM and eosin used in this study exhibit simple excited-state behavior. Their fluorescence anisotropies are not significantly wavelength-dependent (data not shown) 73; their time-resolved anisotropies decay from an initial value near the theoretical limit of 0.4 52,73; and the S1 state responsible for absorption and emission is energetically well separated from other states (based on our CIS calculations and absorption/emission spectra). Thus, the assumption of a single, fixed transition dipole is well-justified for both DACM and eosin. In fact, many commonly employed probe molecules (e.g., coumarins, xanthenes, and AlexaFluor dyes) exhibit this ideal behavior 74,75,76 such that the analysis methods utilized here are reasonable for a large number of FRET systems.
The second assumption can be addressed computationally by evaluating the electronic structure of each probe molecule at each dynamics snapshot 47,77,78. This treatment allows fluctuations in both the transition dipole orientation and magnitude to be accounted for. However, this QM evaluation of every snapshot is computationally intensive, particularly for a trajectory of ∼40ns, and is beyond the scope of the present work.
While accurately representing the molecular transition moments of the D and A is important for modeling the RET behavior of this particular system, the general question of interest here centers on the relative motions of one free and one fixed probe in any system with any arbitrary preferred D-A orientation. Therefore, to more generally report on the consequences of having one free and one fixed probe, we should consider the possibility that the A might bind in any orientation relative to the protein. To address this broader question, we also kept track of an alternate transition dipole for eosin, roughly perpendicular to the actual transition dipole and still in the plane of the aromatic core of the molecule. This transition dipole will be referred to using a y subscript (e.g., the κ factor between this transition dipole and the DACM transition dipole is κy). Taking the cross product of these two vectors provides a second alternate transition dipole (z), such that the group of three approximately form a three-dimensional basis for appraising the general ramifications to FRET of having one fixed and one free probe.
The primary data retrieved from each snapshot of the MD simulation were the orientation of the DACM and eosin transition dipoles and the vector connecting the centers of the dyes, which will be discussed extensively below. However, we will first assess the integrity of the eosin-HEWL complex—an important test since no specific restraints were used to hold the eosin into the protein and because the eosin-HEWL complex is characterized by a modest Kd of 23μM (A. J. Huisman, L. R. Hartsell, B. P. Krueger, and M. J. Pikaart, unpublished). The distance between five different eosin atoms and nearby side chains in the protein were measured at each snapshot. These distances showed small fluctuations about their average values and never showed any large values that would be indicative of unbound eosin (data not shown). Thus, we can be confident that the eosin was bound to the HEWL throughout the full simulation.
Over the course of the ≈40ns of MD simulation, the system samples from a variety of structural families. With respect to the motions of the protein, a two-dimensional RMSD plot is useful for identifying the relationships between these different families. The all-atom (not including the fluorescent probes) two-dimensional RMSD plot (Fig. 3) shows the pairwise difference in structure of each snapshot to every other snapshot. (Due to limited resolution of the figure, snapshots every 200ps were used.) These data indicate that the protein is free to fluctuate between a number of different structures and that the final structures are not dramatically different from the early structures. In turn, these suggest that the system is reasonably well equilibrated at the outset and that the protein is not artificially inhibited by some artifact of the simulation, but is free to sample conformational space. Significant structural transitions are observed at 5.6ns, 10.2ns, 15.2ns, 28.7ns, and 30.3ns. A number of minor transitions between 15.2 and 28.7ns and between 30.3ns and the end could be identified as well, but as this work focuses on the behavior of the fluorescent probes, the detailed structural dynamics of the protein will not be discussed.
To identify whether the structural transitions identified through Fig. 3 are manifested in the behavior of the fluorescent probes, the trajectories of the D-A distance, RDA, and the three κ factors (κ, κy, κz) were examined (Fig. 4). These data were used, independent of Fig. 3, to identify transition times and different probe orientation families. Our goal was not to perfectly separate different statistical regions of κ and RDA behavior, but to roughly group similar regions to yield a modest number of families with identifiable characteristics. Transitions between families were determined based on visual inspection of the κ and RDA trajectories, using obvious shifts in the value of either parameter (e.g., at 8.22ns and 11.79ns) or spikes in either parameter accompanied by changes in the fluctuations of either parameter (e.g., 4.35ns and 25.77ns). Thus, the simulation was, rather arbitrarily, divided into eight time regions, which will be identified as probe orientation families A–H. The transition times and statistical characteristics of each family are given in Table 1,Table 2, and plots of the distributions of κ and RDA for some families are shown in Fig. 5. (Distributions for all families are given in Supplementary Material Figs. S3 and S4 .) It is clear from Table 1,Table 2 and Fig. 5 that there are a number of distinct families of κ and RDA behavior and that these distinctions are paralleled in the two quantities. For instance, the families with the three highest mean κ values, E, F, and G, exhibit three of the four highest RDA values. Family D shows the highest κ standard deviation as well as the highest RDA standard deviation, in contrast to family C, which shows the lowest standard deviations in both κ and RDA. Thus, it is clear that, at the family-to-family timescale, there is some degree of correlation between κ and RDA. This will be explored further after discussion of various calculations of RET rates.
| Table 1 Summary of the eight families of probe orientation behavior |
| RDA | κ | κ2 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Family | Time range (ns) | Unc | Mean (Å) | SD (Å) | Mean | SD | Corr | Mean | SD | Corr | ||
| A | 0–4.35 | 0.01 | 29.8 | 3.0 | −0.26 | 0.48 | 0.42 | 0.30 | 0.37 | −0.20 | ||
| B | 4.35–8.22 | 0.01 | 27.9 | 2.0 | −0.63 | 0.43 | 0.64 | 0.58 | 0.50 | −0.57 | ||
| C | 8.22–11.79 | 0.014 | 22.3 | 0.91 | −0.80 | 0.33 | 0.22 | 0.76 | 0.49 | −0.17 | ||
| D | 11.79–19.03 | 0.01 | 32.5 | 3.9 | 0.44 | 0.73 | 0.80 | 0.73 | 0.53 | 0.32 | ||
| E | 19.03–22.14 | 0.01 | 33.9 | 1.8 | 0.61 | 0.39 | 0.70 | 0.52 | 0.42 | 0.69 | ||
| F | 22.14–25.77 | 0.01 | 33.3 | 2.4 | 0.77 | 0.52 | 0.65 | 0.87 | 0.59 | 0.57 | ||
| G | 25.77–36.17 | 0.01 | 32.0 | 2.8 | 0.69 | 0.39 | 0.72 | 0.63 | 0.51 | 0.67 | ||
| H | 36.17–37 | 0.03 | 29.3 | 1.5 | −0.34 | 0.34 | 0.25 | 0.23 | 0.24 | −0.16 | ||
| All | 0–37 | 0.004 | 30.7 | 4.2 | 0.23 | 0.76 | 0.80 | 0.62 | 0.52 | 0.20 | ||
| Corr is Pearson's linear correlation coefficient (R) between RDA and the specified quantity. Unc is the maximum uncertainty in the correlation coefficient based on the value of the correlation and the population size. |
| Table 2 Summary of the eight families of probe orientation behavior for the alternate (y and z) acceptor transition dipoles |
| κy | ![]() | κz | ![]() | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Family | Unc | Mean | SD | Corr | Mean | SD | Corr | Mean | SD | Corr | Mean | SD | Corr | ||
| A | 0.01 | −0.80 | 0.30 | −0.57 | 0.74 | 0.45 | 0.59 | −0.01 | 0.48 | 0.58 | 0.23 | 0.20 | −0.24 | ||
| B | 0.01 | −0.40 | 0.38 | −0.56 | 0.30 | 0.30 | 0.59 | 0.56 | 0.26 | 0.11 | 0.38 | 0.26 | 0.09 | ||
| C | 0.014 | −0.62 | 0.17 | −0.06 | 0.41 | 0.20 | 0.07 | −0.36 | 0.22 | −0.27 | 0.18 | 0.15 | 0.23 | ||
| D | 0.01 | −1.06 | 0.37 | −0.76 | 1.26 | 0.68 | 0.77 | 0.49 | 0.32 | 0.42 | 0.35 | 0.28 | 0.27 | ||
| E | 0.01 | −0.99 | 0.36 | −0.66 | 1.11 | 0.70 | 0.70 | −0.31 | 0.35 | 0.60 | 0.22 | 0.22 | −0.53 | ||
| F | 0.012 | −1.18 | 0.23 | −0.41 | 1.44 | 0.50 | 0.40 | 0.41 | 0.38 | 0.44 | 0.31 | 0.28 | 0.36 | ||
| G | 0.01 | −1.11 | 0.25 | −0.29 | 1.29 | 0.49 | 0.34 | −0.05 | 0.34 | 0.44 | 0.12 | 0.17 | 0.02 | ||
| H | 0.03 | −0.78 | 0.25 | −0.13 | 0.67 | 0.36 | 0.15 | 0.41 | 0.28 | 0.24 | 0.25 | 0.22 | 0.19 | ||
| All | 0.004 | −0.93 | 0.39 | −0.64 | 1.02 | 0.64 | 0.68 | 0.13 | 0.47 | 0.34 | 0.24 | 0.24 | 0.07 | ||
| Corr is Pearson's linear correlation coefficient (R) between RDA and the specified quantity. Unc is the maximum uncertainty in the correlation coefficient based on the value of the correlation and the population size. |
The electronic interactions that promote RET between the eosin and DACM can be evaluated by examining just the two structural parameters, κ and RDA. Provided the ideal dipole approximation (IDA) holds, use of the simple expression given in Eq. (2) is justified. Given that the two dyes are reasonably far apart in this system (>20Å), we expect the IDA to be valid. In cases where the IDA is suspect, several groups have developed methods for accurately evaluating the D-A interactions 80,81,82,83. While these methods are computationally more demanding than simply evaluating Eq. (2), their implementation is straightforward and practical application in conjunction with MD simulations has been demonstrated 23,42. For systems in which one dye is fixed and the other is free, in principle, to sample all space, 〈κ2〉 depends on the angle between the fixed transition dipole (A in this case) and the vector connecting the centers of the probes such that 5,28
![]() | (5) |
and
(taken from the MD simulations) are ∼39°, 124°, and 75°, which give theoretical
values of 0.93, 0.64, and 0.40, respectively, for the actual eosin transition, y, and z. The average values from the simulation are found to be 
and
The discrepancies between theoretical and observed values for all three of the 〈κ2〉 suggest that the DACM motion is significantly restricted. Examination of the distribution of DACM transition dipole orientations after removing protein motion (Supplementary Material Fig. S5 ) suggests that it explores a fairly limited range of space. However, this visual inspection can be deceiving since the DACM need explore only one hemisphere of space to yield a 〈κ2〉 that agrees with
More quantitative analysis shows that 90.1% of the observed DACM orientations lie within 1.46π steradians or 36.6% of all possible orientations. For comparison, the tightly bound eosin has 99.6% of its distribution concentrated within 0.0744π steradians or 1.86% of all possible orientations (also shown in Supplementary Material Fig. S5 ). Thus, while the DACM is quite free to move relative to eosin, it does clearly explore a restricted portion of orientation space, supporting the observed deviation between
and 〈κ2〉.It follows that replacing κ2 with its theoretical average in Eq. (2) will lead to a poor estimate of the rate. Indeed, combining the theoretical
value with the appropriate average distance factor,
yields average RET rates of 
and
whereas using the 〈κ2〉 value from the MD trajectory in Eq. (2), and assuming Eq. (4) is valid, yields the independently averaged rate,
or 2.03×10−9C and 0.48×10−9C for y and z, respectively. Note that C is the set of constants that describe the spectral properties of DACM and eosin and is implied to contain the units of the rate (see Eqs. (1)). (Because approximations relevant to these constants are not addressed here, all rates will be evaluated simply in terms of the dynamic structural parameters κ and RDA and left in terms of C.) If, however, we do not assume Eq. (4) is valid—that is, we do not assume that κ and RDA are independent—but instead evaluate Eq. (2) at each snapshot and then determine the average rate, we arrive at 
and
Note that for the y transition dipole, the properly averaged rate is a factor of 1.6 smaller than when κ and RDA are assumed to be independent and that for the z transition dipole, kRETz is a factor of 1.9 smaller than assuming the theoretical average value for κ2. These results are summarized in Table 3.
| Table 3 Summary of the different methods of computing the RET rate |
| Rate (× 10−9C)* | ||||||
|---|---|---|---|---|---|---|
| Model | Eosin | y | z | Assumptions | ||
![]() | 1.86 | 1.28 | 0.81 | ; κ and RDA independent; others† | ||
![]() | 1.24 | 2.03 | 0.48 | κ and RDA independent; others† | ||
![]() | 1.28 | 1.27 | 0.43 | Others† | ||
| Model gives the expression used to compute kRET. Rate gives the calculated value of kRET for each of the three A transitions studied. Assumptions lists the assumptions that are appropriate for the given model. |
| * C is the set of constants that describe the spectral properties of the D and A and is implied to contain the units of the rate; see Eqs. (1). † Others means all other approximations required for Förster theory, such as the ideal dipole approximation and that C is constant with respect to structural fluctuations. |
The fact that
and
yield such different values in the y case suggests that there is significant correlation between κ and RDA. Fig. 6 shows a scatter plot of RDA versus κ, which confirms that the two variables do depend on each other. The high extent of correlation between κ and RDA is clearly shown in Fig. 7, an overlay of the κ and RDA trajectories. Using linear regression of RDA versus κ to quantify this correlation yields a correlation coefficient of 0.80±0.001 for the eosin transition, −0.64±0.002 for the y variant, and 0.34±0.004 for z—all clearly nonzero. Of course, it is the correlation between κ2 and RDA that is important to RET. These correlation coefficients are 0.20±0.004, 0.68±0.002, and 0.07±0.004 for the actual, y, and z transition dipoles, respectively.
The contrast in κ – R and κ2 – R behavior for the three eosin transitions (actual, y, and z) is interesting. For the actual eosin transition, there is a high correlation between κ and RDA and a much smaller (though still significant) correlation between κ2 and RDA. Despite the correlation coefficient of 0.20, the
and
measures of kRET differ by <3%. Examination of Table 1 reveals that half of the eight families (the ones with 〈κ〉>0) exhibit positive correlation between κ2 and RDA, and half negative correlation. Thus, good agreement between
and kRET is due to fortuitous cancellation of error over multi-nanosecond timescales (comparable to the D excited-state lifetime) because the value of κ fluctuates roughly equally about zero. For the y variant, the correlation between
and RDA is higher than that between κy and RDA (Table 2). In this case, the particular orientation of the A transition dipole is such that the
correlation is always positive and mainly significant, larger than 0.3 for six of the eight families, leading to the large discrepancy between kRET and
Finally, in the z variant, the correlation between
and RDA is generally small, <0.3 for six of the eight families leading to good overall agreement between kRET and
.
Table 1,Table 2 show that some of the probe orientation families exhibit high κ – RDA correlation (absolute value >0.3) for all three A transition dipoles (families A, D, E, and F) and some exhibit consistently low correlation (families C and H). Looking at family C more carefully reveals that its low correlation is the consequence of an instability in the relationship between κ and RDA. Breaking family C into portions 1000-points (200ps) long reveals correlation coefficients that range from −0.46 to 0.76. Thus, it is not that κ and RDA are uncorrelated throughout family C; instead, there is a relationship between κ and RDA, but that relationship is unstable and the positive and negative correlations cancel each other. (Because of its relatively small number of points, family H was not analyzed in this way.) Thus, if one looks at small enough timescales, timescales that are comparable to energy transfer times in many cases, the correlation between κ and RDA is found to be significant at all points throughout the simulation. Note that the instability observed in family C is likely the result of DACM being bound to the surface of the protein (Fig. 1). This causes the fluctuations in both κ and RDA to be small in magnitude such that their relationship can easily change from positively correlated to negatively correlated. Thus, ironically, the long-term correlation between κ and RDA is smallest in this particular case when both probes are essentially fixed to the protein.
The fact that κ – RDA correlation in family C disappears as one averages over longer and longer timescales suggests that the overall correlation observed here might become less significant if the MD simulation were extended to μs or ms timescales. While this is certainly possible, it is important to note that the majority of probe orientation families found here exhibit significant κ – RDA correlation. Thus, for this system, correlation between κ and RDA appears to be typical; lack of correlation is the exception. It follows that any new probe orientation families that might be sampled during a longer MD trajectory are likely to also exhibit κ – RDA correlation. As pointed out earlier and shown in Fig. 7, there is significant correlation over both short and long (family-to-family) timescales. The existence of multi-nanosecond correlation is supported by the fact that the overall κ – RDA correlation is larger than any of the individual family κ – RDA correlations. Thus, while we cannot guarantee that the 40ns MD simulation has fully sampled representative dynamics of the D-lysozyme-A system, the observation of correlation between κ and RDA (Fig. 7) throughout a variety of structural families (Fig. 3) suggests that extending the MD simulation to longer timescales may actually result in an increase in the observed overall κ – RDA correlation. Indeed, the κ – RDA correlation coefficient for the first 20ns of the simulation is 0.79, for the last 20ns is 0.66, and overall is 0.80.
It is also important to note that there are two distributions with which the D and A sampling of space are important: the distribution of structures sampled over the same timescale as energy transfer, and the total distribution of structures. Even if the full set of relevant system structures do not exhibit correlation between κ and RDA, it is clear that the κ – RDA correlation is significant over timescales relevant to energy transfer.
What is it that drives this correlation? It appears to be a result of the stiffness of the succinimide group that links the cysteine side chain with the DACM probe, and which results from the commonly-used maleimide conjugation chemistry. There is little flexibility in the connection between the coumarin skeleton of the dye and the five-membered ring of the succinimide, nor is the succinimide group itself very flexible. Thus, the motions of the dye are expected to be mainly due to flexibility about the β-carbon and sulfur of the cysteine side chain. Histograms of the motions of the five dihedral angles (defined in Fig. 2) are shown in Fig. 8. Dihedrals 1 and 2 each display a single peak, at 180° and 120°, respectively. Dihedral 4 displays more flexibility, but still samples a single peak for the majority of snapshots (180° ≈70% of the time). Dihedrals 3 and 5 show the most flexibility, with #3 sampling mainly two angles (60° and 180° totaling ≈90% of the snapshots) and #5 sampling strongly from three angles (≈50% at –60°, and ≈20% at 60° and 180°). The result of the relative inflexibility of the dye-protein linkage is that the dye behaves as though it is at the end of a long lever arm and, therefore, both its angular orientation with respect to the protein and its position are dependent on the conformation of the cysteine side chain.
It is likely that this correlation in angular orientation and spatial position with respect to the protein exists for most fluorescent molecules that are conjugated through maleimide chemistry. In the system of study here, the other dye (eosin) is essentially fixed relative to the protein so the correlation becomes apparent. In the more typical case where both probes are free to move, the relative motions of the two dyes are likely complex enough that the correlation becomes obscured in many cases.
While the results observed here are likely to be relevant to a wide variety of systems, the reader should bear in mind that the particular questions being addressed will determine their importance. As has been pointed out by many authors, the practical use of FRET as a structural probe is greatly aided by the
in Eq. (2). That large dependence of kRET on RDA translates to a small dependence of RDA on kRET. Thus, the 60% error in kRETy observed here when ignoring the correlation between κy and RDA would translate into ∼10% error in determining RDA from an experimental value of kRET. In many cases this amount error is significant, in many cases it is not. However, in all cases the validity of assuming that κ and RDA are independent should be taken into consideration, along with the other FRET approximations. Finally, we note that the dynamic structural information determined here through MD simulations allows models for time-resolved emission decays of the D and A to be constructed 47. These models would allow direct comparison of the structural dynamics modeled by the simulation with experimentally measurable observables. Further analysis along these lines will be presented elsewhere.
MD simulations have been used to examine the structural fluctuations of an archetypal protein, HEWL, labeled in typical fashion with a donor dye, DACM, and noncovalently binding an acceptor dye, eosin, into a fixed pocket. Because of the structure of the linkage that results from the maleimide conjugation chemistry, the angular orientation and the spatial position of the DACM are related. This results in a significant degree of correlation between κ and RDA, which yields a factor of 1.6 error in estimating kRET when one assumes κ and RDA are independent. The presence of correlation between κ and RDA would lead to a noticeable (∼10%) error in calculating RDA from an experimental kRET in this system. This correlation also highlights the fact that the standard treatment of FRET data assumes, among other things, the independence of κ and RDA, though this has been rarely discussed in the literature. In addition, the 〈κ2〉 value is quite different than the theoretical value for one free probe and one fixed probe. As the methods applied to study FRET, the systems studied with FRET, and the questions addressed by FRET all become broader and more complex, recognition of the κ and RDA independence approximation, along with other FRET approximations, becomes increasingly more important.
The authors thank Katie L. Hinkle for assistance with statistical analysis as well as Jacquelyn D. Lewis and Nora E. Kuiper for examination of the steady-state anisotropy behavior of DACM and eosin.
This work was generously supported by grants from Research Corporation, the American Chemical Society Petroleum Research Fund, the Howard Hughes Medical Institute, and the National Science Foundation Major Research Instrumentation and Research Experience for Undergraduates programs. Computer simulations were performed at the National Center for Supercomputing Applications. D.B.V. also recognizes the Beckman Foundation and Merck & Co. for their support.
An online supplement to this article can be found by visiting BJ Online at http://www.biophysj.org.
1. (1996). Fluorescence resonance energy transfer. In Wang, X.F., Herman, B., eds. Fluorescence Imaging and Microscopy. Chemical Analysis Vol. 137, (New York: Wiley Interscience). PubMed
2. (1999). Resonance Energy Transfer. (New York: John Wiley & Sons). PubMed
3. (1967). Energy transfer: a spectroscopic ruler. Proc. Natl. Acad. Sci. USA 58, 719–726. CrossRef | PubMed
4. (1978). Fluorescence energy transfer as a spectroscopic ruler. Annu. Rev. Biochem. 47, 819–846. PubMed
5. (1974). Intramolecular distances determined by energy transfer. Dependence on orientational freedom of donor and acceptor. Biopolymers 13, 1573–1605. CrossRef | PubMed
6. (1991). Picosecond fluorescence of simple photosynthetic membranes: evidence of spectral inhomogeneity and directed energy transfer. Chem. Phys. 149, 409–418. PubMed
7. (1992). Distance distribution in a dye-linked oligonucleotide determined by time-resolved fluorescence energy transfer. Biophys. Chem. 45, 133–141. CrossRef | PubMed
8. (1996). Excitation energy transfer in carotenoid-chlorophyll protein complexes probed by femtosecond fluorescence decays. Chem. Phys. Lett. 260, 147–152. PubMed
9. (1997). Fluorescence resonance energy transfer as a probe of DNA structure and function. Methods Enzymol. 278, 417–444. CrossRef | PubMed
10. (1999). Folding dynamics of single GCN-4 peptides by fluorescence resonant energy transfer confocal microscopy. Chem. Phys. 247, 69–83. PubMed
11. (1999). Depolymerization of phospholamban in the presence of calcium pump: a fluorescence energy transfer study. Biochemistry 38, 3954–3962. PubMed
12. (2002). Time-resolved fluorescence resonance energy transfer: a versatile tool for the analysis of nucleic acids. Biopolymers 67, 159–179. PubMed
13. (2005). Protein structure and dynamics from single-molecule fluorescence resonance energy transfer. J. Phys. Chem. B 109, 1626–1634. PubMed
14. (1995). Excitation transfer in the core light-harvesting complex (lh-1) of Rhodobacter sphaeroides: an ultrafast fluorescence depolarization and annihilation study. J. Phys. Chem. 99, 16179–16191. CrossRef | PubMed
15. (2000). A single-molecule study of RNA catalysis and folding. Science 288, 2048–2051. CrossRef | PubMed
16. (1968). Fluorescence spectroscopy of proteins. Science 162, 526–533. PubMed
17. (2005). Imaging protein molecules using FRET and FLIM microscopy. Curr. Opin. Biotechnol. 16, 19–27. CrossRef | PubMed
18. (2005). Polyproline and the “spectroscopic ruler” revisited with single-molecule fluorescence. Proc. Natl. Acad. Sci. USA 102, 2754–2759. CrossRef | PubMed
19. (2006). Fluorescence probes of protein dynamics and conformations in freely diffusing molecules: single-molecule resonance energy transfer and time-resolved fluorescence methods. In Geddes, C.D., Lakowicz, J.R., eds. Reviews in Fluorescence Vol. 3, (New York: Springer). PubMed
20. (1978). Intramolecular energy transfer in molecules with a large number of conformations. Proc. Natl. Acad. Sci. USA 75,