| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
Correspondence: Address reprint requests to M. Ederer, E-mail: ederer{at}mpi-magdeburg.mpg.de.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Microscopic reversibility restricts the set of physically possible parameter values of thermodynamic systems. For systems near equilibrium, this was discussed in 1931 by Onsager (4
,5
). Near equilibrium, the constraints have the form of symmetry conditions on phenomenological parameters of the linearized equations. Constraints on stationary, far-from-equilibrium flux distributions of reaction networks were recently discussed in the literature (6
12
). The constraints can be written as sign conditions on stationary fluxes. These publications also contain good introductions to the treatment of nonequilibrium reaction systems in thermodynamics. A review concerning detailed balance in dynamic, kinetic modeling can be found in Heinrich and Schuster (13
). In the literature (14
,15
) detailed balancing is discussed in the framework of a formal, mathematical representation of general kinetic systems. Recently, the problem of how to impose detailed balance in complex reaction mechanisms was discussed in two articles (16
,17
). The former proposes a method based on explicit identification of stoichiometric cycles. The latter relies on the framework of generalized kinetic systems (14
) and presents a parameterization of the model in terms of equilibrium concentrations and fluxes.
Here, we attack the problem of imposing detailed balance in kinetic models from a thermodynamic point of view. We confirm and extend the results of Yang et al. (17
) and give an intuitive thermodynamic interpretation.
As the following example shows, kinetic models containing true cycles may easily describe systems that violate microscopic reversibility, if no special care is taken. We consider a reaction network describing the random-order complexation of three compounds A, B, and C:
![]() | (1) |
Because the order of complexation is arbitrary, the system contains a cycle:
![]() | (2) |
Assuming a homogeneous phase of constant volume, the mole balances read
![]() | (3) |
Here ci [mol m3] and Jj [mol m3 s1] denote concentrations and fluxes, respectively. We assume that the reactions obey a mass-action law with kinetic parameters k±j:
![]() | (4) |
The second law and detailed balance demand that thermodynamic equilibrium with J1,eq = J2,eq = J3,eq = J4,eq = 0 exists. With ci,eq denoting the equilibrium concentrations this leads to the condition that the product of the equilibrium constants along the cycle has to be one:
![]() | (5) |
The above equation for the parameters is called detailed balance relation or Wegscheider condition. For all other parameter combinations the model describes a physically impossible system with a non-zero steady-state flux and thus a permanent deviation of the concentrations from the equilibrium ratios.
The deviation could be used by an attached system to perform work; i.e., energy-rich products could be formed from energy-poor precursors without consumption of energy-rich substrates. Thus the model appears as a chemical perpetuum-mobile, which can produce chemical energy. In this sense, a model violating the Wegscheider conditions may be interpreted as a system violating energy conservation (18
).
The argumentation above is based on the existence of thermodynamic equilibrium, but is not limited to systems that actually reach thermodynamic equilibrium. The crucial point is, that a system will reach thermodynamic equilibrium, if it is isolated from its surroundings (all boundary fluxes zero). In biological terms this means that the system will die, if feeding stops.
Determination of detailed balance relations requires the consideration of the complete stoichiometry. Many models use a simplified stoichiometry, where the balance equations of certain compounds are omitted and their concentrations are assumed to be constant. This is done often with the ubiquitous metabolites adenosine triphosphate (ATP), adenosine diphosphate (ADP), and inorganic phosphate (P). Their concentrations are assumed to be tightly regulated and usually only a small part of all reactions transforming ATP, ADP, and P are modeled. The simplified stoichiometry cannot be used for deriving detailed balance relations. We will illustrate this by means of our example network (Eq. 1). Assume that the stoichiometry of reaction 1 was not complete, but that it includes the cleavage of energy-rich ATP into its energy-poor constituents ADP and P. The true stoichiometry of reaction 1 is
. Because the concentrations of ATP, ADP, and P are assumed to be constant, the form of the rate laws in Eq. 4 needs not to be changed, if k±1 values are allowed to depend on the concentrations of ATP, ADP, and P. By clamping these concentrations, an external thermodynamic force is imposed upon the system, preventing it from going to thermodynamic equilibrium, and driving a cyclic flux. Such cycles are called futile cycles, since they are driven by a permanent inflow of energy-rich (ATP) and outflow of energy-poor (ADP, P) compounds. In steady state the energy dissipated by the cycle is equal to the amount of work needed to keep ATP, ADP, and P concentrations constant by rephosphorylating the produced ADP. The argument used to derive the detailed balance relations in Eq. 5 of the original network is not valid here, because it is based on the assumption that thermodynamic equilibrium is reached. This shows that detailed balance relations can only be formulated for true cycles but not for futile cycles. When working with a simplified stoichiometry one cannot derive detailed balance relations, since it is impossible to decide if a certain cycle is a true cycle or a futile cycle. For this reason, we will assume that in the following the considered network stoichiometries are complete.
Whenever a system contains true cycles and its parameters are not known completely and accurately, detailed balance relations have implications on modeling and model analysis. Their explicit consideration can simplify parameter estimation from experimental data, because they lower the number of independent parameters. However, they put constraints on the usual praxis of adopting parameters from other models of similar systems. This can be illustrated with a series of models of epidermal growth factor (EGF) signaling in mammalian cells. During EGF signal transduction phosphorylated Shc, Grb2, and Sos form a complex as A, B, and C in the above example. This subnetwork is embedded in a larger network that contains many phosphorylations and dephosphorylations (especially of Shc and Sos). However, the stoichiometry of the complexation cycle is complete, because none of the reactions requires the cleavage of ATP. In Kholodenko et al. (19
), a model of EGF signaling is presented that explicitly acknowledges detailed balance relations and adjusts the parameter values accordingly. The model presented in Schoeberl et al. (20
) adopts some of the parameters from Kholodenko et al. (19
) but changes others without observance of detailed balance. Finally Liu et al. (21
) analyzes the model of (20
) but again changes a few parameters without consideration of detailed balance. Consequently the latter two models are thermodynamically infeasible and a cyclic flux can occur. A more detailed analysis would be necessary to check whether this inconsistency has a major effect on the model behavior or if its effect is negligible.
In Schoeberl et al. (20
) as well as Liu et al. (21
) a sensitivity analysis was performed, i.e., changes in system behavior induced by small changes in each single parameter are studied. However, because the parameters are not independent from each other, sensitivity analysis may yield misleading results, because impossible parameter variations are also considered. A more detailed analysis would also be necessary here, to judge the consequences.
A problematic aspect of the model in Eqs. 3 and 4 is that it does not formally consider the driving forces of the reactions. Model equations for electrical and mechanical systems explicitly contain voltages and forces, which are physical quantities driving electrical current and momentum changes, respectively; that is, fluxes and conjugated forces point in the same direction. In the case that no external forces are imposed to the system this guarantees the existence of an equilibrium state where all forces and fluxes are zero. Thus the fulfillment of the second law and of detailed balance is ensured by the model structure and it is not necessary to observe special detailed balance equations for the parameters. In this view model equations that violate detailed balance are comparable to the picture "Waterfall" by the graphic artist M.C. Escher. It shows a cyclic self-sustained water flow driving a water wheel. This is physically impossible, because water always flows from higher to lower levels; i.e., absolute water levels may serve as potentials for water flow. Graphical representation of a cyclic water flow is possible, because the mapping of a point in the drawing plane to a point in space is not unique and so it is possible to interpret the drawn water surfaces differently. The beholder chooses one of the possible interpretations of a certain surface depending on its local context in the picture. The illusion is based on the fact that every small part of the picture shows a realistic situation with a water flow from higher to lower potentials. Nevertheless as a whole it shows an impossible situation, violating energy conservation.
To structurally avoid such situations, models have to explicitly contain the potentials and driving forces. But as we will see, the quantities usually considered as potentials and driving forces of a reaction systemchemical potentials and negative Gibbs reaction energiesare quite unsuited for kinetic modeling far from equilibrium. Here, we suggest an equivalent alternative driving force that simplifies the model equations.
A holistic understanding of large biochemical networks requires models that describe the functioning of the system under the constraints of mass and energy conservation. Both classes of constraints are determined by the complete stoichiometry of the network. Kinetic modeling structurally accounts for the constraints of mass conservation in the mole balance equations, but can violate energy conservation, if the detailed balance relations are not fulfilled. This situation is unsatisfactory. In the following we will introduce a formalism that is similar to traditional kinetic modeling but structurally forbids models that violate energy conservation. The formalism is useful for modeling large reaction networks with many interconnected stoichiometric cycles. We suggest the term "thermodynamic-kinetic modeling" (TKM) for the proposed formalism.
| METHODS |
|---|
|
|
|---|
![]() | (6) |
Rn, J
Rm, and k
Rl are the vectors of concentrations, fluxes, and parameters, respectively. N
Rn x m is the stoichiometric matrix. The kinetic properties of the reactions are modeled by the dependency of fluxes J on concentrations c and parameters k. The detailed balance relations that hold in such a system can be written as BT log(Keq) = 0, where log(Keq)
Rm denotes the vector of the logarithms of the equilibrium constants. The columns of the matrix
span the dm-dimensional right null space of N, i.e., N B = 0. Thus there exist dm = m rank(N) independent detailed balance conditions that restrict the possible values of the equilibrium constants Keq,j of the reactions. These relations are referred to as the "generalized Wegscheider condition". For their derivation, see Heinrich and Schuster (13
Assumptions
In the following we assume that the kinetic rate equations can be described by ideal mass-action rate laws with constant coefficients as in Eq. 4. Later we will relax this assumption and discuss systems of generalized mass-action rate laws. The use of mass-action kinetics implies two thermodynamic assumptions that are discussed in the following:
[mol m3]. The molar density is the sum of the concentrations of all species in a phase. It is approximatively constant if we assume a high fraction of inert components. In this case, the sum of the concentrations of the reactive species ci is much smaller than the molar density:
. The main component of biological reaction systems is usually water. It participates in many biochemical reactions, but their rates are low compared to water concentration. For this reason, the condition c
const is usually fulfilled in biological reaction networks.
Kinetic and thermodynamics of a reaction
We consider a reaction with educts E, products P, and stoichiometric coefficients
i:
![]() | (7) |
The reaction rate follows a mass-action law
![]() | (8) |
[S] [J K1 m3 s1] of a reaction is given by T
[S] =
µ J, where J [mol m3 s1] is the reaction flux, T [K] the thermodynamic temperature and
µ [J mol1] is the negative Gibbs reaction energy
![]() | (9) |
µ1 = µA + µB µAB.
For ideal mixtures, the chemical potentials µi are given by
![]() | (10) |
.
Because we assumed thermodynamic independence of the reactions, the entropy production density
[S] of each single reaction is necessarily positive or zero. This means that a reaction can only proceed if the Gibbs energy of its products is smaller than the Gibbs energy of the educts. Thus for every reaction it holds that sign
µ = sign J. This condition assures observance of the second law (
[S]
0) and detailed balance (Jeq = 0 for equilibrium with
µeq = 0). In analogy to electrical theory where electrical resistances are the ratio of voltage drop and current, there exists a positive thermodynamic resistance
with
. So,
µ may be interpreted as the thermodynamic force driving flux J through resistance
. Unfortunately, not even for the simplest kinetic laws is the resistance
constant; instead, it takes a complicated nonlinear form containing polynomial and logarithmic terms. E.g., the resistance of reaction 1 in Eq. 1 is
. Observe, that the zero sets of numerator and denominator of
are equal, since
. Thus the expression contains removable singularities.
The characterization of a reaction kinetic by its resistance
is possible, but due to its complex form, unpracticable. Hence this representation is usually only used in its linearized form in the vicinity of thermodynamic equilibrium. Since biology usually studies systems far from equilibrium, it is quite unsuited for modeling.
| RESULTS |
|---|
|
|
|---|
![]() | (11) |
We call
i the thermokinetic potential of Xi. Because it is derived from µi, the thermokinetic potential
i also describes the potential of a reactive species Xi. A nice property of
i is that for ideal mixtures (see Eq. 10) it is proportional to ci:
![]() | (12) |
The factor Ci [mol m3] is the capacity of i. It depends on the chemical standard potential µi,0:
![]() | (13) |
The Gibbs reaction energies can be written in dependence on the
i, e.g., for reaction 1 in the example network it is
µ1 = R* T (log(
A
B) log(
AB)).
An interpretation of the above-introduced quantities Ci and
i can be gained by considering thermodynamic equilibrium states with µeq,
eq, and ceq. A reaction is in thermodynamic equilibrium, if
µeq = 0. Thus the condition for thermodynamic equilibrium of the whole reaction system is in matrix notation
![]() | (14) |
This means that the capacities determine the equilibrium constants:
![]() | (15) |
Forces and resistances
By a redefinition of the thermodynamic force along a reaction, we can achieve a constant resistance for simple mass-action kinetics. The key property of
µ that has to be conserved is its sign distribution in dependence on the concentrations. We may easily construct a quantity F with sign(F) = sign(
µ) by applying the exponential function to both terms in Eq. 9,
![]() | (16) |
![]() | (17) |
![]() | (18) |
. We call F the thermokinetic force along a reaction. The according resistance R [mol1 m3 s] is defined by
![]() | (19) |
A nice property of R is that it is constant for mass-action kinetics. A comparison of coefficients of the kinetic rate law in Eq. 8 and the relation J = R1 F yields that
![]() | (20) |
Both expressions for R are equivalent, because the capacities Ci implicitly define the equilibrium constants (Eq. 15). For reaction 1 in the example network it holds that
.
Non-mass-action kinetics
The formalism can easily be extended to non-mass-action rate laws. Consider the generalized mass-action rate law
![]() | (21) |
. This is an arbitrary, positive function of c. Thus for a non-mass-action kinetic the resistance is no longer constant.
We will demonstrate this for the single reaction
with a reversible Michaelis-Menten kinetic with enzyme concentration cE:
![]() | (22) |
This is a generalized mass-action kinetic with k+ = 1/kX,0, k = 1/kY,0, and f(c, q) = cE/(1 + cX/kx,1 + cY/kY,1) with q = (kX,1, kY,1). With an appropriate scaling one can use the ki,0 as capacities Ci = ki,0. The corresponding resistance is R = (1 + cX/kX,1 + cY/kY,1)/cE or R = (1 + kX,0/kX,1
X + kY,0/kY,1
Y)/cE. This may be written as R =
0 +
X
X +
Y
Y where
i are new parameters. Thus for complex kinetics the resistances are not constant, but are dependent on the concentrations of educts and products. For typical kinetic laws of catalyzed reactions the dependency is polynomial or rational.
For modeling reaction networks, irreversible kinetic laws, with an always-positive reaction flux, are often used. An example is the Michaelis-Menten kinetic J = Jmax cS/(km + cS) where cS is the substrate concentration and Jmax and km are parameters. Physically, however, there are no such irreversible reactions because, according to the principle of microscopic reversibility, the direction of any reaction can be reversed by providing high product and low educt concentrations. Consequently, irreversible kinetic laws cannot be modeled with the proposed formalism, because they are thermodynamically infeasible. Reactions appear to be irreversible, if the potentials of the educts
are always much larger than those of the products
. Thus the apparent irreversibility of a reaction is a systemic property and not a property of a single reaction. When gaining for a physically sound model, the irreversibility should follow from the complete model equations and not be modeled explicitly.
Let us review what we gained by defining thermokinetic potentials
i (Eq. 11) and forces Fj (Eq. 17). Irreversible thermodynamics uses chemical potentials µi and negative Gibbs reaction energy
µ for the description of a reaction. This, however, leads to complex expressions for the resistances
. By introducing the thermokinetic force F, we heavily simplified the resistance R, which is now constant for mass-action kinetics and polynomial or rational for kinetics of catalyzed reactions. To fulfill the constraints imposed by energy conservation the modeler only has to assure that R is positive. To be able to conveniently express the thermokinetic force F, we introduced the thermokinetic potentials
i. The thermokinetic force of a reaction is the difference of two power law expressions in the
i whose exponents are the stoichiometric coefficients. Thermokinetic potentials
i and concentrations ci are proportional, such that a constant capacity relating both can be defined.
Model equations
With the above definitions we can write a mathematical model of a reaction system as
![]() | (23) |
By N we denote the stoichiometric matrix and by c and J the vectors of concentrations and reaction fluxes, respectively. We will illustrate this by writing down the model equations of our example (Eq. 1). They consist of the mole balances already formulated in Eq. 3, the relations between thermokinetic potentials and concentrations
![]() | (24) |
![]() | (25) |
An analogy to electrical networks becomes obvious. The concentrations ci are the chemical counterpart of electrical charges. They are the integral of the incoming flux and current, respectively. The thermokinetic potentials
i are analogs of the electrical voltages. By the capacities Ci they are linked to concentrations and charges, respectively. Forces Fj correspond to voltage drops along resistances Rj. However, whereas electrical forces are always linear in the potentials, thermokinetic forces may be nonlinear, since for reactions of higher order the forces Fj follow a nonlinear mass-action law.
The three parts of Eq. 23 correspond to the basic constraints that determine the behavior of a reaction network. The first part models the constraints introduced by mass conservation. The parameters are collected in the stoichiometric matrix N. In the second part the position of thermodynamic equilibrium is defined by the capacities Ci. Implicitly, for given concentrations this determines the signs of the reaction fluxes consistent with energy conservation and the second law. The third part represents the kinetic part: it describes the behavior of the flux in nonequilibrium. Its parameters are the resistances Rj.
Ordinary differential equations
Above we stated the model equation in the form of a differential-algebraic equation system in the variables c,
, and J. For practical reasons it is often advantageous to use a representation as an ordinary differential equation system. Simple operations allow the representation of J as a function of
or c, respectively. Thus it is possible to reduce the differential-algebraic system to an ordinary differential system in
or c, respectively. In the latter case, the only difference to models of the traditional kinetic modeling formalism is that the parameters are Ci and Rj and not k±j.
Open systems
Usually kinetic models do not describe closed systems that actually approach thermodynamic equilibrium. The most common assumption is that several concentrations or fluxes are clamped, i.e., are fixed to a specified value. As discussed in the Introduction, this is often done with ATP, ADP, and P concentrations. When clamping these concentrations in an ATP consuming network, one implicitly assumes the existence of an attached unmodeled system that rephosphorylates the produced ADP. In TKM models clamping can be done analogously, by fixing the respective quantities. The resulting model is thermodynamically feasible, since internal cyclic fluxes are not possible. However, it does not go to thermodynamic equilibrium, since a continuous flow through the system occurs.
It is also possible to model parts of the network in the TKM formalism and parts in the traditional kinetic modeling formalism. Then some of the reaction fluxes Jj are not determined by forces and resistances, but by a traditional concentration-dependent kinetic rate equation. This may be reasonable in order to include subnetworks with partly unknown stoichiometry.
Parameters and their values
Degrees of freedom
Although capacities and resistances are well-defined physical parameters, their measurement may be difficult. The capacities Ci depend on the chemical potentials of the pure compounds µi,0, but µi,0 cannot easily be determined absolutely. However, the key property is that the capacities define equilibrium concentrations and in this way the equilibrium constants. Any set of equilibrium concentrations ci,eq of the network may serve as capacities Ci = ci,eq and still the condition sign F = sign J is fulfilled (see Eq. 18) and thus positive resistances Rj exist. The different possible choices of the parameters change the scaling of the thermokinetic potentials
i and the forces Fj, but leave the concentrations ci and fluxes Ji invariant.
One may ask the question: How many degrees of freedom exist for the choice of the capacities? For given initial conditions, thermodynamic equilibrium is unique and two initial conditions lead to the same equilibrium state, if stoichiometrically possible. Thus the number of independent conservation relations dn that restrict the stoichiometric abilities of the network (e.g., cA + cAB + cABC = const) is equal to the number of degrees of freedom for the choice of capacities and resistances. To formalize this we reformulate the equilibrium condition in Eq. 14. The vector ceq is a vector of equilibrium concentrations and thus can be used as an alternative capacity vector, if
![]() | (26) |
These are rank(N) conditions for n concentrations ci and thus there are dn = n rank(N) degrees of freedom to choose new capacities ceq.
After determining the capacities Ci the resistances Rj can be identified from dynamic measurement data or from a comparison of coefficients with a known kinetic law.
Example
To illustrate the determination of capacities and resistances we will compute them for the example network in Eq. 1 using the parameters given in Kholodenko et al. (19
) for the complexation of phosphorylated Shc (A), Grb2 (B), and Sos (C):
![]() | (27) |
First we determine equilibrium concentrations for A, B, C, AB, BC, and ABC to use them as capacities. Observe that for this we have three degrees of freedom, because the system contains three conservation relations (dn = 3), one for each conserved moiety A, B, and C, respectively. For determining the capacities we can freely choose their total concentrations. To arrive at meaningful values we use the total concentrations given in Kholodenko et al. (19
) for the total concentrations of Shc, Grb2, and Sos:
![]() | (28) |
With the above values we compute the steady state
. Up to rounding errors this is an equilibrium state with J1 = J2 = J3 = J4 = 0, because the detailed balance relations are fulfilled. The equilibrium concentrations can be used as capacities:
![]() | (29) |
Using Eq. 24 we now can determine the resistances Rj by a comparison of coefficients of Eq. 25 and Eq. 4:
![]() | (30) |
Observe that the two alternatives for calculating the resistances Rj in the above equation are only equivalent, since the capacities Ci define a thermodynamic equilibrium. The kinetic parameters k±j of a model violating detailed balance cannot be converted into capacities Ci and resistances Rj.
A closer look at the above-computed capacities and resistances reveals large differences of their values. The low value of CBC means that an accumulation of BC is thermodynamically unfavorable, because a high concentration leads to very high
BC. This strongly activates BC-consuming reactions. The high value of R3 indicates that additionally the formation of BC is kinetically unfavorable. This suggests that the complexation usually proceeds in the order
. Similar considerations can be made on the basis of the parameters k+j and kj. There, however, the split into thermodynamic and kinetic reasoning is difficult. The above considerations are confirmed by a numerical simulation (Fig. 1): Because of our choice of the capacities as the equilibrium state that is actually approached, all thermokinetic potentials
i approach 1. Concentration cBC is very low, but its thermokinetic potential
BC is comparable to the thermokinetic potential
AB. Force F3 is quite high, but J3 is very small.
|
Parameter numbers
Let us compare the number of parameters of the traditional and the new TKM formalism. The example-model requires eight parameters in the traditional formalism (two for each reaction), but 10 in the TKM formalism (six capacities + four resistances). The eight parameters in the traditional formalism are constrained by one detailed balance equation. That is, there are seven independent parameters. To choose the 10 parameters of the TKM formalism we have three degrees of freedom. Thus there are effectively seven independent parameters. To compare the parameter numbers in general (see Table 1), we consider a network consisting of n compounds and m reactions with mass-action type kinetics. The left and the right null space of N have dimensions dn = n rank(N) and dm = m rank(N), respectively. This means there are dn conservation relations and dm independent cycles. The usually applied formalism requires 2 m formal parameters, one for each forward k+j and backward kj reaction. These parameters are partly related to each other by dm detailed balance relations. Thus there are 2 m dm = m + rank(N) independent parameters. Our formalism requires m + n parameters. Every compound has a capacity Ci and every reaction has a resistance Rj. Since usuallyin contrast to the examplethe number of reactions m exceeds the number of compounds n, m + n is normally <2 m. Additionally every conservation relation adds an degree of freedom to the choice of parameters. Thus the number of independent parameters for a network consisting of mass-action type reactions is here, de facto, equal to m + n dn = m + rank(N).
|
Merging models
The nonuniqueness of the parameters has to be observed, when merging two TKM models into a single model. The parameters Ci and Rj of the two models can only be combined directly, if the capacities refer to a compatible equilibrium state. Otherwise the parameters have to be converted accordingly (see Table 1). A simpler way to merge two models is to first convert the Ci and Rj to the rate parameters k±j. Observe, that the k±j are not constant, if Rj correspond to a non-mass-action kinetic. Then the k±j of the two models can be merged without problems. The resulting model can be converted back to the TKM formalism. Necessary conversion laws are listed in Table 1.
Comparison to Yang et al. (17)
In Yang et al. (17
) a method for imposing detailed balance in complex reaction networks is introduced. Considering Markov models of single molecule dynamics and a formal representation of general mass action kinetics, they arrive at an elegant formulation of the detailed balance relations and use those to suggest an alternative parameterization for kinetic models. They suggest parameterizing kinetic models by the equilibrium concentrations ceq,i and the unidirectional equilibrium rates
. Here, motivated from the formalism of irreversible thermodynamics and its notion of potentials and forces, we arrive at a parameterization with capacities Ci = ceq,i and resistances
(see Eq. 20). Our approach provides an intuitive, thermodynamic interpretation of these parameters. We discuss their relation to thermodynamic potentials and forces. The use of thermokinetic potentials
i and forces Fj provides a physically sound but intuitive way for the notation of model equations. The formalism is applicable to non-mass-action kinetics.
| DISCUSSION |
|---|
|
|
|---|
In the following we first discuss cases where a violation of detailed balance is critical. Then we review the TKM formalism and show how it may be useful in such cases.
Examples of large models
Recently, several large models of biochemical networks were developed that consider the complete stoichiometry of the modeled reactions. The behavior of such networks is constrained by thermodynamics.
In the literature (23
,24
) an approach for modeling signal transduction systems is presented that accounts for the whole combinatorial complexity emerging from the high number of different possible protein complexes. This approach is extended in Conzelmann et al. (25
) to reduce the number of necessary model equations. In addition to futile cycles, networks of protein complexation contain many true cycles (see Eq. 2) as well, and thus their parameters are subject to detailed balance relations. This issue is crucial for a correct parameterization of signal transduction networks. However, it is not explicitly discussed in the above-mentioned publications and no strategies are offered to come to a valid parameterization. We expect that the behavior of signal transduction networks is constrained by thermodynamics and that some of their design principles can only be understood from this viewpoint.
For several large metabolic networks the complete stoichiometry of each reaction is available mainly due to the efforts of the group around Palsson (Escherichia coli (26
), Helicobacter pylori (27
), Staphylococcus aureus (28
), Methanosarcina barkeri (29
), and Saccharomyces cerevisiae (30
)). These models are mainly used in the context of constraint-based modeling (31
) (e.g., flux balance and elementary mode analysis). In future they may serve as the basis of kinetic models. The power of constraint-based modeling shows that the behavior of metabolic networks is strongly determined by their stoichiometry and by the constraints of mass and energy conservation (3
). Thus for their kinetic modeling it is essential to explicitly consider the energy balance constraints.
Modeling, identification, and analysis
Models, as described above, contain many true cycles. Additionally, their kinetic parameters are often not known exactly. Thus if no special care is taken, the actual chosen parameter set very likely violates detailed balance relations and the model may show a behavior that is physically impossible. The situation gets more involved, if one considers parameter variations.
For parameter identification often numerical optimization methods are used. The model parameters are varied to find the parameter set that leads to the best fit of model behavior and measurement data. If detailed balance relations are not explicitly considered, the optimization method will spend most of the time examining impossible parameter sets. If some parameters are not identifiable (e.g., due to noisy or missing measurement data) the best fit is likely to be attained at infeasible parameter values. On the other hand if the detailed balance relations are considered, less measurement data is necessary to identify the parameters uniquely.
Model analysis often involves sensitivity or robustness analysis, i.e., the effects of small parameter perturbations are studied. This is also critical, because one has to take care to exclude thermodynamically infeasible parameter combinations. Otherwise, impossible parameter variations are considered and the result is biased.
The goal of systems biology is to gain a holistic understanding of biological systems. In particular, this means we must understand how biological systems function under the unalterable constraints imposed by basic physical laws. It is desirable that the model equations fulfill these constraints structurally, i.e., that with any choice of the kinetic parameters, the model describes a physically feasible system. Whereas in traditional kinetic modeling the mole balance equations structurally account for mass conservation, energy conservation has to be assured by an appropriate choice of the kinetic parameters.
TKM formalism
We introduced a novel formalism called thermodynamic-kinetic modeling (TKM) for building kinetic differential equation models of reaction networks that structurally account for energy conservation. For this purpose the full stoichiometry of the network has to be known. The formalism explicitly considers driving forces of reactions. The driving forces are directed toward thermodynamic equilibrium and thus toward lowering the Gibbs energy in the system. The simple condition that flux and force are always directed into the same direction assures the existence of thermodynamic equilibrium and thus the observance of energy conservation. Our method differs from former approaches in the choice of potentials and forces. Traditionally the chemical potentials µi and consequently the negative Gibbs reaction energies
µj are used. In Heinrich and Schuster (13
) it is discussed that thermodynamic flow-force relationships of this kind are often inferior to kinetic rate equations. Kinetic rate equations allow us to incorporate detailed mechanistic knowledge and thus give deeper insight. Further, thermodynamic forces
µj do not contain the full information that determines the reaction rates. A force
µ depends only on the reaction quotient
and thus is invariant to variations of concentrations that do not change the reaction quotient. However, reactions rates are usually not invariant to such variations. An example will clarify this: If the concentrations cX and cY in the reaction
double, the reaction quotient cY/cX and thus
µ remains equal. However, if we assume mass-action kinetics the reaction rate J = k+ cX k cY doubles. This scaling information is not present in
µ and has to be captured by the dependence of the resistance
on the concentrations ci. Thus the use of
µ as a force for reactions far from equilibrium is unpracticable, since the according resistancethe quotient of force and fluxis highly nonlinear.
We construct the thermokinetic force F that has the same sign distribution as
µ, but is proportional to mass-action reaction rates. Thus the resistance R = F/J of a mass-action reaction is constant. The dependence of the resistance on concentrations has to capture only effects that come from a deviation from ideal mass-action behavior. The thermokinetic force F can be conveniently expressed using thermokinetic potentials
i of educts and products. They are the ratios of concentrations ci to constant capacities Ci. Any set of equilibrium concentrations may serve as capacities Ci.
The TKM formalism provides an adequate way to introduce the thermodynamic concepts of potentials and forces to kinetic modeling and thus resolves the dilemma whether to use thermodynamic flux-force relationships or kinetic rate equations for describing biochemical reaction networks.
If, as it is the case for most natural reaction networks that the number of reactions exceeds the number of compounds, then the TKM formalism will require fewer parameters than the traditional formalism and thus simplify parameter identification and sensitivity analysis (see Table 1). A slight complication is caused by the nonuniqueness of capacities. Parameter values cannot be determined uniquely, but different parameter sets are equivalent with respect to the description of concentrations and fluxes. The different choices affect only the scaling of potentials and forces. In praxis the nonuniqueness is not an obstacle for using the TKM formalism, because the different possible choices can be easily converted.
TKM of large metabolic systems
To demonstrate the use of the TKM formalism, we consider a hypothetical, kinetic model of the metabolism of E. coli K-12 iJR904 with stoichiometry as published in Reed et al. (26
). The following computations were made with Mathematica (32
); for the code see the Supplementary Material. The network contains n = 762 compounds and m = 931 metabolic reactions plus one reaction describing cell growth. The stoichiometric matrix N
R762x932 has rank(N) = 722. Thus there are dm = 210 independent cycles in the network, corresponding to 210 independent Wegscheider conditions (see Table 1). A closer look reveals cycles, that contain phosphorylation of ADP, e.g.,
![]() | (31) |
R932x210 with N B = 0 using exact arithmetics. Since the kernel matrix is not unique and kernel matrices with a lot of zero entries will give shorter cycles, we bring B to column-reduced echelon form. Then
99% of its elements are zero. The 210 columns of B correspond to 210 independent cycles. This means that 23% of the equilibrium constants cannot be adjusted freely, but are determined by Wegscheider conditions. Cycles contain between 2 and 49 reactions (mean 9.2). A reaction participates, on average, in 2.1 cycles (minimum 0, maximum 107). The number of reactions that do not participate in any cycle is equal to the number of zero rows in B and thus is independent of the actual choice of B. Only 48% of the reactions do not participate in any cycle. This means that changes in 52% of the equilibrium constants in the model have side effects on other equilibrium constants, possibly distributed over several functional units. This makes it very difficult to assess the effect of parameter changes during modeling. Let us assume that the network is modeled by reversible, generalized mass-action kinetics (see Eq. 21). The number of parameters in the traditional formalism is dtrad = 1864 plus the parameters describing the deviation from ideal mass-action behavior. In the TKM formalism we have dTKM = 1694 parameters plus the parameters describing the dependence of the resistances on the concentrations. The de facto number of mass-action parameters is d = 1654 (see Table 1). This means that the traditional formalism requires 13% and the TKM formalism 2.4% more mass-action parameters than minimally required. Further 40 capacities can be chosen freely.
TKM of large signal transduction systems
Let us assume a scaffold protein that binds k different ligands at k different binding sites. This is a typical motif in many signal-transducing pathways, where the ligands are signaling proteins, kinases, or phosphatases. Observe that the example network in Eq. 1 is of this kind with k = 2. Every binding site may be occupied or unoccupied. Thus there are 2k possible complexes of the scaffold protein and the number of all compounds in the system is n = 2k + k. The number of different reactions per ligand is 2k1 and thus the number of all reactions is m = 2k k/2. There are dn = k + 1 conservation relations and thus the number of cycles is dm = (k/2 1) 2k + 1 and the number of de facto parameters is d = (k/2 + 1) 2k 1 (see Table 1). In Table 2 cycle numbers and parameter numbers are compared. The ratio of equilibrium constants that can be determined by Wegscheider conditions (dm/m) grows with k. Already at k = 4 more than half of the equilibrium constants are determined by the rest. When comparing the minimal number of de facto mass-action parameters d with the number of mass-action parameters necessary for a model in the traditional or the TKM formalism, we get the over-parameterization of the respective formalisms. At k = 1 the traditional formalism has the minimal number of parameters and the TKM formalism needs twice as many parameters. But already at k = 3 the use of the TKM formalism is advantageous in terms of parameter numbers.
|
TKM and model parameterization
A major feature of the TKM formalism is thatin contrast to the traditional formalismthe parameters of TKM are naturally associated with compounds and reactions: The capacities Ci determine the equilibrium of the network. Low capacities Ci indicate a high free energy content of the respective compound. An accumulation of this compound is energetically unfavorable.
Capacities are thermodynamic quantities based on the standard chemical potentials that are organism-independent. The standard Gibbs energies of formation of many chemical species are tabulated (33
). For biochemical reactions it is of advantage when working with reactants that are sums of species (e.g., ATP = ATP4 + HATP3 + MgATP2). For reactants, standard transformed Gibbs energies of formation can be computed and are tabulated (34
). These numbers may be used as chemical standard potentials of the respective species or reactants and thus capacities can be directly calculated. Such tables are particularly useful for the modeling of metabolic networks.
Resistances Rj and capacities Ci determine the behavior in nonequilibrium. Low resistances Ri indicate that a reaction proceeds very fast, if its not in equilibrium. Resistances in metabolic networks depend on the characteristics of the catalyzing enzymes and thus are organism-dependent.
The only thermodynamic condition on the new parameters Rj and Ci is their positiveness.
TKM and sensitivity analysis
Consider a sensitivity analysis of the steady-state of the example network in Eq. 1. Remember, that the steady-state of the example is thermodynamic equilibrium. The interpretation of the steady-state sensitivity matrices
css/
k and
Jss/
k is difficult, since variations of the eight parameters k±j are restricted to a seven-dimensional nonlinear set. In particular it is not easily visible, that the sensitivity of the steady-state fluxes Jss is zero. When one explicitly considers the Wegscheider condition, the variations of a reduced parameter vector k'
R7 are studied. The eighth parameter is determined by the Wegscheider condition. Then
Jss/
k' = 0, but the matrix
css/
k' is hard to interpret, since a variation of an element of k' is not exclusively associated with a reaction. In the TKM formalism it is
css/
R = 0, reflecting the fact, that the equilibrium state is independent of the reaction mechanisms. Further
Jss/
C = 0 and
Jss/
R = 0, since in thermodynamic equilibrium fluxes vanish. Only the matrix
css/
C is not a zero matrix, since the equilibrium composition is determined by the capacities Ci. This example shows that a correct interpretation of sensitivities with respect to mass-action parameters k±j may be difficult. Variations of capacities Ci and resistances Rj provide a natural way to study the sensitivity of reaction networks.
In systems biology one faces the problem of mathematical modeling of large reaction networks. Often their structure expressed by the stoichiometric matrix is completely known, but initially only insufficient information on kinetic parameters is available. In such cases kinetic modeling and model analysis is complicated by the need to observe detailed balance relations. By using thermokinetic potentials and forces the TKM formalism structurally avoids violation of detailed balance. Additionally, fewer parameters are required. Thus it supports the modeler in building physically sound models.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
M.E. acknowledges support from the Landesstiftung Baden-Württemberg and the Bundesministerium für Bildung und Forschung.
Submitted on July 28, 2006; accepted for publication November 27, 2006.
| REFERENCES |
|---|
|
|
|---|
2. Olivier, B. G., and J. L. Snoep. 2004. Web-based kinetic modeling using JWS Online. Bioinformatics. 20:21432144.
3. Stelling, J., S. Klamt, K. Bettenbrock, S. Schuster, and E. D. Gilles. 2002. Metabolic network structure determines key aspects of functionality and regulation. Nature. 420:190193.[CrossRef][Medline]
4. Onsager, L. 1931. Reciprocal relations in irreversible processes I. Phys. Rev. 37:405426.[CrossRef]
5. Onsager, L. 1931. Reciprocal relations in irreversible processes II. Phys. Rev. 38:22652279.[CrossRef]
6. Yang, F., H. Qian, and D. A. Beard. 2005. Ab initio prediction of thermodynamically feasible reaction directions from biochemical network stoichiometry. Metab. Eng. 7:251259.[CrossRef][Medline]
7. Qian, H., D. A. Beard, and S. dan Liang. 2003. Stoichiometric network theory for nonequilibrium biochemical systems. Eur. J. Biochem. 270:41521.[Medline]
8. Qian, H., and D. A. Beard. 2005. Thermodynamics of stoichiometric biochemical networks in living systems far from equilibrium. Biophys. Chem. 114:213220.[CrossRef][Medline]
9. Price, N. D., I. Famili, D. A. Beard, and B. O. Palsson. 2002. Extreme pathways and Kirchhoff's second law. Biophys. J. 83:28792882.
10. Beard, D. A., S. dan Liang, and H. Qian. 2002. Energy balance for analysis of complex metabolic networks. Biophys. J. 83:7986.
11. Beard, D. A., E. Babson, E. Curtis, and H. Qian. 2004. Thermodynamic constraints for biochemical networks. J. Theor. Biol. 228:327333.[CrossRef][Medline]
12. Beard, D. A., and H. Qian. 2005. Thermodynamic-based computational profiling of cellular regulatory control in hepatocyte metabolism. Am. J. Physiol. Endocrinol. Metab. 288:E633E644.
13. Heinrich, R., and S. Schuster. 1996. The Regulation of Cellular Systems. Chapman & Hall, New York.
14. Horn, F., and R. Jackson. 1972. General mass action kinetics. Arch. Ration. Mech. Anal. 47:81116.
15. Feinberg, M. 1972. Complex balancing in general kinetic systems. Arch. Ration. Mech. Anal. 49:187194.
16. Colquhoun, D., K. A. Dowsland, M. Beato, and A. J. R. Plested. 2004. How to impose microscopic reversibility in complex reaction mechanisms. Biophys. J. 86:35103518.
17. Yang, J., W. J. Bruno, W. S. Hlavacek, and J. Pearson. 2006. On imposing detailed balance in complex reaction mechanisms. Biophys. J. 91:11361141.
18. Qian, H. 2005. Cycle kinetics, steady state thermodynamics and motorsa paradigm for living matter physics. J. Phys. Condens. Matter. 17:S3783S3794.[CrossRef]
19. Kholodenko, B., O. Demin, G. Moehren, and J. Hoek. 1999. Quantification of short term signaling by the epidermal growth factor receptor. J. Biol. Chem. 274:3016930181.
20. Schoeberl, B., C. Eichler-Jonsson, E. D. Gilles, and G. Müller. 2002. Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors. Nat. Biotechnol. 20:370375.[CrossRef][Medline]