Article Outline

Article Information

PubMed

Related Articles

  • …more

Copyright © 2006 The Biophysical Society. All rights reserved.
Biophysical Journal, Volume 91, Issue 11, L87-L89, 1 December 2006

doi:10.1529/biophysj.106.090837

Biophysical Letters

Ab Initio Self-Consistent X-Ray Absorption Fine Structure Analysis for Metalloproteins

Nicholas Dimakis*Go To Corresponding Author  and Grant Bunker

* University of Texas-Pan American, Edinburg, Texas
Department of Biological Chemical and Physical Sciences, Illinois Institute of Technology, Chicago, Illinois

Address reprint requests and inquiries to Nicholas Dimakis, Tel.: 956-380-8761; Fax: 956-381-2423.

Abstract

X-ray absorption fine structure is a powerful tool for probing the structures of metals in proteins in both crystalline and noncrystalline environments. Until recently, a fundamental problem in biological XAFS has been that ad hoc assumptions must be made concerning the vibrational properties of the amino acid residues that are coordinated to the metal to fit the data. Here, an automatic procedure for accurate structural determination of active sites of metalloproteins is presented. It is based on direct multiple-scattering simulation of experimental X-ray absorption fine structure spectra combining electron multiple scattering calculations with density functional theory calculations of vibrational modes of amino acid residues and the genetic algorithm differential evolution to determine a global minimum in the space of fitting parameters. Structure determination of the metalloprotein active site is obtained through a self-consistent iterative procedure with only minimal initial information.

Metalloproteins constitute ∼30% of all total known proteins in nature 1. Their active sites typically consist of a metal ion center that is coordinated to various amino acid residues. Structural information of metalloproteins is probed by x-ray crystallography, x-ray absorption fine structure (XAFS) 2,3, and a variety of spectroscopic techniques. Although x-ray crystallography is the primary probe for protein structure, its utility in providing accurate metalloprotein structural information is limited to availability and quality of crystalline samples. XAFS provides complementary local structural information about a selected metal ion for molecules in solution as well as crystals and polycrystalline samples 4. Obtaining structural information for Zn sites from conventional spectroscopic methods is difficult or impossible.

Significant advances have been made in recent years in XAFS theory and experimental technology; however, biological XAFS has not achieved its full potential due to limitations on data analysis and modeling of the experimental XAFS spectra. Structural information is usually obtained by XAFS when experimental spectra are fitted by a theoretical model that describes a hypothetical structure by a set of adjustable parameters (e.g., the edge-energy shift ΔE0, the amplitude reduction factor , the absorber- ith scatterer distances Ri, and the mean-square variations of electron single- (SS) and multiple- (MS) half-scattering paths 5. Clark-Baldwin et al. 6 had described the difficulty of correctly modeling Zn active sites due to the high parameter correlation during XAFS data analysis (e.g., a ZnCys3O site has been incorrectly modeled with tetrahedral ZnS47, and ZnCys4 was incorrectly modeled as ZnS3O 8) even in highly symmetrical conformations when MS is minimal. This is due to the presence of multiple solutions that are consistent with the experimental XAFS spectra within the uncertainties. This situation substantially worsens when MS dominates, which usually occurs when the x-ray absorbing metal atom is almost collinear with two other scatterers (e.g., metal-histidine complexes). In this case, the number of parameters significantly exceeds the limited information content in the XAFS data 9, and additional information or assumptions must be provided.

An attempt to reduce the number of MS fitting parameters on active site of metalloproteins was reported by Dimakis and Bunker 10 by expressing the SS and MS as a direct function of the first shell metal-amino acid residue distance, and the sample temperature thus virtually eliminating these parameters from the fitting procedure. This has been achieved by calculating the phonon normal mode properties on a series of small metal-amino acid model compounds at various metal-first shell scattering atom distances using density functional theory (DFT) 11. However, parameter reduction alone is not sufficient to avoid obtaining an incorrect structure via XAFS especially for cases of unknown metalloproteins where the information about the actual identity of the amino acids and their apparent distances from the known central absorbing metal is limited. We have developed an automatic iterative technique where a series of XAFS χ(k) spectra from hypothetical structural conformations is directly fitted to the experimental XAFS spectra using 1), the global fitting genetic algorithm “differential evolution” (DE) 12 to minimize the mean-square error,

(1)
between χ(k) XAFS spectra of hypothetical and experimental spectra, and concurrently 2), eliminate the numerous SS and MS factor parameters from the fitting procedure (Fig. 1, left). The process is fully ab initio and only depends on the choice of the absorbing metal.

Display large version of this figure
Figure 1
(Left) Fitting procedure. (A). DE population consists of randomly generated DE parameter vectors {Ri, ΔE0, , }. (B) A new DE vector is generated from three other DE vectors of the initial population. (C) A trial DE vector is produced through a crossover operation. (D) 1), This DE vector is sent to the Tool Command Language (TCL) script and a hypothetical structure is generated; 2), χ(k) FEFF8- 13 generated spectra are produced and their scattering paths information is sent back to the TCL script; 3), are calculated by the TCL script for all-important SS/MS paths; and 4), χ(k) with DWF information are generated. (E) The corresponding error fit is calculated. The process 1–4 is repeated for the target vector. If the error fit e2 from the trial vector is less than the corresponding error fit from the target vector, the trial vector replaces the target vector in the DE population. (Right) Optimal DE four shell fits for a), ZnCys4 and b), Zn-Cys3His tetrahedral geometries at 40K. Fitting is performed over actual unfiltered experimental χ(k) k3 XAFS spectra with noise at k-range 0–12.5Å−1 for a), 60 and b), 110 scattering paths, respectively.

The method is tested using two Zn metalloprotein active sites: the homogeneous ZnCys4 and the heterogeneous ZnCys3His ligation, either modeled as tetrahedral compounds. Structural information for these samples had been obtained by Clark-Baldwin et al. 6 using traditional least-squares approximation where 1), the number of available fit parameters was restricted (ΔE0=9eV, =1.02, =0.85), during fitting to avoid wrong identification of the ligation, and 2), MS was ignored. To prove the validity of our technique and to avoid any bias over the selection of the initial parameter guess, the DE vector parameters have been chosen within the following wide range intervals: RCys=(2.1–2.6) Å, RHis=(1.8–2.5) Å, ΔE0=±20.eV, =0.5–1.5, and Å2, where accounting for static σ2 and miscalculations on the . Each metal-ligand distance has been treated as an independent parameter. Although there has been no clear argument on how the DE process formally terminates, one can assume that this successfully occurs when the parameters of the final DE population are within an acceptable small range. In our case, we have set the following criteria for the final DE population: ΔE0=±1.5eV, =±0.05, average ligand distance R=±0.002Å, whereas no criteria apply to the Δσ2 parameter.

Allowing parameter variation within the above intervals, the optimal DE vector fits for either sample has been obtained after 316 and 234 steps, respectively; the process may terminate earlier by selecting smaller parameter variation ranges (Fig. 1, right). These automatic high quality fits provide e2 values of 0.655 and 0.670 for the homogenous and heterogeneous structures, respectively, which are higher than the corresponding values reported for the same compounds under conventional least-squares fitting (Table 1). This is due to 1), noise included in the DE fitting procedure (XAFS χ(k) spectra are unfiltered) and 2), increasing number of parameters on a two-shell refinement reduces the e2 value in the least-squares fitting algorithm. An improvement in the error e2 does not necessarily lead to improved data description: e2 may systematically be reduced by performing a Fourier filtering on the XAFS χ(k) spectra. Average radial first shell distances obtained from the DE algorithm are in agreement with previous XAFS calculations and XRD. Analysis of the optimal DE population reveals the reliability of the structural parameters obtained with our method; focusing on the region of zero Δσ2, the average first shell radial distance is <5/1000Å (Fig. 2). Additionally, these vectors possess e2 values within <% with respect to the value of the DE optimal vector.

Our method is general: it can be applied to any metalloprotein or hemoprotein as long as the SS and MS values have been accurately expressed as a function of the first shell distance and sample temperature.

The code will be available upon request from the authors.

Acknowledgment

We thank Prof. James Penner-Hahn for providing us the XAFS experimental spectra.

References and Footnotes

1. Quevillon-Cheruel, S., Collinet, B., Zhou, C.-Z., Minard, P., Blondeau, K., Henkes, G., Aufrère, R., Coutant, J., Guittet, E., Lewit-Bentley, A., et al. (2003). A structural genomics initiative on yeast proteins. J. Synchrotron Rad. 10, 4–8. PubMed

2. Stern, E.A. (1988). Theory of EXAFS. In X-ray Absorption. Koningsberger, D.C., Prins, R., eds. (New York: Wiley), Chap. 1. PubMed

3. Lee, P.A., and Pendry, J.B. (1975). Theory of the extended x-ray absorption fine structure. Phys. Rev. B11, 2795–2811. PubMed

4. Hasnain, S.S., and Hodgson, K.O. (1999). Structure of metal centres in proteins at subatomic resolution. J. Synchrotron Rad. 6, 852–864. PubMed

5. The parameters that are a direct function of the normal mode properties of the sample and its temperature are incorporated into the XAFS χ(k) amplitude as exponential decay of the form called Debye-Waller factor (DWF), being the photoelectron momentum.

6. Clark-Baldwin, K., Tierney, D.L., Govindaswamy, N., Gruff, E.S., Kim, C., Berg, J., Koch, S.A., and Penner-Hahn, J.E. (1998). The limitations of x-ray absorption spectroscopy for determining the structure of zinc sites in proteins. When is a tetrathiolate not a tetrathiolate?. J. Am. Chem. Soc. 120, 8401–8409. CrossRef | PubMed

7. Dent, A.J., Beyersmann, D., Block, C., and Hasnain, S.S. (1990). Two different zinc sites in bovine 5-aminolevulinate dehydratase distinguished by extended x-ray absorption fine structure. Biochemistry 29, 7822–7828. PubMed

8. Povey, J.F., Diakum, G.P., Garner, G.D., Wilson, S.P., and Laue, E.D. (1990). Metal ion co-ordination in the DNA binding domain of the yeast transcriptional activator GAL4. FEBS Lett. 266, 142–146. CrossRef | PubMed

9. Stern, E.A. (1993). Number of relevant independent points in x-ray-absorption fine-structure spectra. Phys. Rev. B48, 9825–9827. PubMed

10. Dimakis, N., and Bunker, G. (2004). XAFS Debye-Waller factors for Zn metalloproteins. Phys. Rev. B70, , 195114(1)–195114(12). PubMed

11. Hohenberg, P., and Kohn, W. (1964). Inhomogeneous electron gas. Phys. Rev. 136, B864–B871. PubMed

12. Storn, R., and Price, K. (1997). Differential evolution—a simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 11, 341–359. PubMed

13. Ankudinov, A.L., Ravel, B., Rehr, J.J., and Conradson, S.D. (1998). Real-space multiple-scattering calculation and interpretation of x-ray-absorption near-edge structure. Phys. Rev. B58, 7565–7576. PubMed

14. Simonson, T., and Calimet, N. (2002). CysxHisy-Zn2+ interactions: thiol vs. thiolate coordination. Proteins 49, 37–48. CrossRef | PubMed

Publication Information


Received: June 8, 2006
Accepted: September 11, 2006