Cristian Gratie | Ion Petre Complete Characterization for the FitPreserving Data Refinement of Mass-Action Reaction Networks TUCS Technical Report No 1128, January 2015 Complete Characterization for the FitPreserving Data Refinement of Mass-Action Reaction Networks Cristian Gratie Computational Biomideling Laboratory, Turku Centre for Computer Science and Department of Information Technologies, Åbo Akademi University Joukahaisenkatu 3-5 A, 20520 Turku, Finland [email protected] Ion Petre Computational Biomideling Laboratory, Turku Centre for Computer Science and Department of Information Technologies, Åbo Akademi University Joukahaisenkatu 3-5 A, 20520 Turku, Finland [email protected] TUCS Technical Report No 1128, January 2015 Abstract The systematic development of large biological models often relies on iteratively adding more details to an initial, simplified, abstraction of the modeled system. In data refinement, species of the original model are substituted with several variants (subspecies) in the refined one, each with its own individual behavior. In this context, one can distinguish between structural refinement, where the aim is to generate meaningful refined reactions, and quantitative refinement, where one looks for a refined model that better fits available experimental data. The latter is generally a computationally expensive process, as it requires refitting the model and sometimes additional data. It is possible to reuse parameter values from the previous iteration by constructing a fit-preserving refinement, i.e. one that captures the same species dynamics as the original model, but accounts for the newly introduced subspecies. We showed in previous work that fit-preserving refinements can be computed efficiently by providing simple linear constraints on the rate constants of the refined model, which are sufficient for fit-preservation. In this paper, we extend our result and provide a complete characterization of fit-preserving refinement, in the form of a necessary and sufficient condition, applicable for mass-action reaction networks with uniquely identifiable rate constants. Furthermore, we discuss the refinement of a simple model, the Brusselator, and show that, while constrained to capture the same species dynamics as the original model, fit-preserving refinements can still be very different with respect to the behavior of subspecies. Furthermore, our examples provide an intuition about how experimental data can be used to guide model fitting by identifying classes of qualitatively distinct behaviors. Keywords: biomodeling, canonical refinement, fit-preserving refinement, model fit, parameter estimation, quantitative model refinement TUCS Laboratory Computational Biomodeling Laboratory 1 Introduction The top-down development of large biological models relies on iteratively adding details to an initial abstraction of the biological phenomena. At each step, the numerical setup of the mathematical model that describes the dynamic behavior of the system needs to be fit against existing experimental data. Model fitting is often a computationally expensive process, thus it would be beneficial to reuse previously fit values to initialize the parameter estimation routines for the refined model. We focus here on reaction-based models that rely on mass-action kinetics [7]. Data refinement, introduced in [2], refers to the replacement of one (or more) species with several variants, or subspecies, in the refined model. The approach presented in [2] for the numerical setup of the refined model aims to preserve the fit of the original model and consists in assigning parameter values in such a way that the ODEs describing the original model can be recovered as a sum of ODEs from the refined model. We formalized this idea in [4] as fit-preserving refinement and provided a sufficient condition for preserving the numerical fit. The condition links the refined parameters to those of the original model without the need for inspecting the ODEs. The approach from [4] provides a partial answer to the open problem formulated in [3] of finding values for the unknown parameters of a partially specified refined model so that it preserves the numerical fit of the original model: as long as the known values do not already lead to a violation of the fit-preservation constraints, the problem has at least one solution, which can be effectively computed. Since the proposed condition is only sufficient, for some models it is possible to build fit-preserving refinements for which the parameter values do not satisfy the condition. We address this issue here and show that all such models are in fact “pathological” cases that do not have uniquely identifiable rate constants, a critical requirement for being able to fit the model with experimental data, see [1]. This paper is an extended version of [4]. In addition to [4], we include here the full proof of its main result, as well as all mathematical considerations leading to it. Moreover, we prove that the sufficient condition given in [4] is in fact a necessary and sufficient condition, i.e. a complete characterization of fit-preserving refinement, provided that the original model has uniquely identifiable rate constants. Finally, we illustrate our approach on a new case study, that of the Brusselator [9], to show that fit-preserving refinement alone can give rise to very different refined models. The paper is structured as follows. Section 2 provides an introduction to chemical reaction networks and the main formal results related to uniquely identifiable rate constants and solutions of ordinary differential equations. In Section 3 we formally discuss fit-preserving refinement and prove the main 1 result of this paper. We apply our approach to the study of a simple example, the Brusselator [9], in Section 4, where we analyze four different refinements of the model. We discuss the implications of our result in Section 5. 2 Preliminaries We first fix some notations used throughout the paper. We denote by N the set of non-negative integers and by R≥0 the set of non-negative real numbers. For two sets X, Y we denote by X Y the set of mappings f : Y → X; for a finite set Y , X Y can also be seen as the set of vectors of dimension |Y | with elements from X. Throughout this paper we will always denote vectors with a lower-case bold-faced letter. We will use bold-faced upper-case letters to denote matrices or functions that have multiple inputs and outputs. 2.1 Solutions of Autonomous Ordinary Differential Equations Let F : Rn → Rn be a continuously differentiable function. We focus on the solutions of the following autonomous ordinary differential equation (ODE): ẋ = F (x) (1) with the initial condition x(t0 ) = x0 . Following common practice in the literature, we will also call this an initial value problem (IVP). We can also understand equation (1) as a system of differential equations if we consider the components of x and F (x) separately. A solution of such an equation is a function x : I → Rn , where I ⊆ R is an interval containing t0 , such that x is differentiable and, moreover, ẋ(t) = F (x(t)), for all t ∈ I. It is well known that such IVPs have a unique solution in the neighborhood of t0 , as long as F satisfies some reasonable assumptions, for example see [6]. Since our work involves the careful manipulation of these ideas, we provide (without proof) formal results for both the existence and uniqueness in what follows. First, let us see that if we are given a solution x of equation (1) with the given initial condition, then the function x(t − t0 ) satisfies the same differential equation, with the initial condition specified at 0, i.e. x(0) = x0 . Thus, it suffices to only consider initial conditions specified for t0 = 0. Theorem 1. Let F : Rn → Rn be a continuously differentiable function and x0 ∈ Rn . Then there exists a closed interval [−a, a] and a unique function x : [−a, a] → Rn such that ẋ(t) = F (x(t)) for all t ∈ [−a, a] and x(0) = x0 . For a complete proof of Theorem 1, the reader may see [6]. The uniqueness applies also for extensions of the solution to larger intervals. In particular, we can consider the maximal time domain for which a solution exists and 2 refer to the corresponding solution as the solution of the equation for the given initial condition. Definition 1. Let F : Rn → Rn be a continuously differentiable function. For every α ∈ Rn , we will use a[α] to refer to the unique, maximal (with respect to its time domain) solution of the differential equation ȧ = F (a), with a(0) = α. We will use dom(a[α]) to denote the domain of a[α]. Note that the time domain of a solution a[α] may also depend on the actual value of α. 2.2 Reaction Networks We consider in this paper that all reactions are irreversible; any reversible reaction is replaced by its “left-to-right” and “right-to-left” irreversible reactions. We formalize in the following the notion of a reaction, using both a rewriting rule style, and a vectorial style. We first define the notions of species and complexes. Definition 2. Let S = {S1 , . . . Sm } be a finite set whose elements we call species. A vector in NS is called a complex over S . If c = [c1 , . . . , cm ]T is a complex over S , then we say that ci is the multiplicity (or, equivalently, the stoichiometric coefficient) of species Si in complex c, for all 1 ≤ i ≤ m. Note that our notion of complex refers to a linear combination of species that may occur on either side of a reaction. It should not be confused with the concept of a chemical complex, which would be represented in our notation through a single species. We define now reactions and reaction networks. Definition 3. Let S = {S1 , . . . , Sm } be a set of species. A reaction r over S is a pair of complexes c, d over S , r = (c, d). If c = [c1 , . . . , cm ]T and d = [d1 , . . . , dm ]T , reaction r is usually written in the style of a rewriting rule as r: m X ci Si → i=1 m X di Si i=1 For the sake of a compact notation, we will also use, following [1], a vectorial notation: r:c→d To a reaction r we can associate a kinetic rate constant kr ∈ R≥0 ; the kinetic rate constant is usually indicated on top of the arrow in both conventions for writing a reaction. 3 We note that even though we indicate the multiplicity of all species, both on the left-hand side, and on the right-hand side of a reaction, in practice most of the stoichiometric coefficients are zero. Indeed, chemical kinetics show (see, e.g., [8]) that reactions where the sum of multiplicities on their left-hand side is at least three have such a low kinetic rate constant that their effect is negligible. Definition 4. A reaction network is a pair N = (S , R), where S is a finite set of species, S = {S1 , . . . , Sm }, and R is a finite set of reactions over S , R = {r1 . . . , rn }, where: rj : m X cij Si → i=1 m X dij Si , 1 ≤ j ≤ n. i=1 We also write each reaction through its vectorial convention: rj : cj → dj , 1 ≤ j ≤ n, where cj = [c1j , . . . , cmj ]T and dj = [d1j , . . . , dmj ]T . A mass-action reaction network is a tuple M = (S , R, k), where (S , R) is a reaction network and k ∈ RR ≥0 ; kc→d is called the reaction rate constant of reaction c → d ∈ R. The system of ODEs associated to a mass-action reaction network is constructed as follows. We associate to each species Si ∈ S a real function si : R≥0 → R≥0 , whose intended interpretation is the time-dependent concentration of Si . These functions are defined, in the case of mass-action kinetics, through the following system of ODEs: ṡi = n X (dij − cij )kj m Y scqqj , 1 ≤ i ≤ m. q=1 j=1 We also write this system of ODEs in a compact, vectorial style as ṡ = X kc→d sc (d − c), (2) c→d∈R ci where s = [s1 , . . . , sm ]T , ṡ = [ṡ1 , . . . , ṡm ], and sc = m i=1 si . Note that (2) is an autonomous ODE; let F (s) be its right hand side. Since in the case of mass-action kinetics the function F is a multinomial with respect to the components of s, it is continuously differentiable and, thus, the existence and uniqueness of the solution s[s0 ] is guaranteed by Theorem 1, for any initial values s0 ∈ RS . Q 4 2.3 Uniquely Identifiable Rate Constants Let M = (S , R, k) be a mass-action reaction network. We introduce, in the spirit of [1], the following notation for the right-hand side of a mass-action system (2), using the vectorial style notation: r[R, k](α) = X kc→d αc (d − c), c→d∈R for all α ∈ RS ≥0 . Definition 5. [1] A reaction network N = (S , R) is said to have uniquely identifiable rate constants if, for any two distinct rate constant vectors k1 , k2 ∈ RR ≥0 , it holds that r[R, k1 ] 6= r[R, k2 ]. In words, the fact that a reaction network has uniquely identifiable rate constants simply means that two distinct rate constant vectors cannot give rise to exactly the same ordinary differential equation. From a model fitting perspective, such a property is clearly desirable. In its absence, even very precise measurements would not enable one to find the appropriate rate constants, since several rate constant vectors can lead to exactly the same dynamic behavior. Note that the property from Definition 5 only depends on the set of reactions R. Thus, it is a structural property that ensures a different dynamical behavior for each possible reaction rate constant vector k. Definition 6. Let N = (S , R) be a reaction network. A complex c over S is a source complex in N if it is the left hand side of at least one reaction from R, i.e. there exists at least one complex d over S such that c → d ∈ R. Theorem 2. [1] A reaction network N = (S , R) has uniquely identifiable rate constants if and only if, for each source complex c of N , the reaction vectors {d − c | c → d ∈ R} are linearly independent. 3 Fit-Preserving Data Refinement The data refinement of a reaction network is about adding some details into a network, e.g. through replacing one or more species of the network with a set of subspecies carrying more detailed and potentially differentiated behavior. In the general setting, we assume to have two sets of species S and S 0 and a relation ρ ⊆ S × S 0 that links each species from S to its corresponding subspecies in S 0 . For each S ∈ S we denote ρ(S) = {S ∈ S 0 | (S, S 0 ) ∈ ρ}. The intuition of species refinement is formally captured in Definition 7. Definition 7. [4] Let S and S 0 be two sets of species. A relation ρ ⊆ S ×S 0 is a species refinement relation if and only if it satisfies the following conditions: 5 1. for each S ∈ S , ρ(S) 6= ∅; 2. for each S 0 ∈ S 0 there exists exactly one S ∈ S such that S 0 ∈ ρ(S). Intuitively, when ρ(S) = {S10 , . . . , Sr0 }, we mean that species S is refined and replaced in the refined model by its subspecies S10 , . . . , Sr0 . Each species from the original model should be refined to at least one species in the refined model (more than one in the case of non-trivial refinements) and each species of the refined model should correspond to exactly one “parent” species from the original model. A species refinement ρ can also be written as an (S × S 0 )-matrix with {0, 1} entries as follows: Mρ = (mS,S 0 )S∈S ,S 0 ∈S 0 , 1, if S 0 ∈ ρ(S) ; mS,S 0 = 0, otherwise . (3) By definition, there is at least a 1-entry on each row of the matrix and there is exactly one 1-entry on each of its columns. We will refer to Mρ as the characteristic matrix of the species refinement relation ρ. The notion of species refinement is extended to complexes, reactions, and reaction networks as follows. 0 Definition 8. Let S = {S1 , . . . , Sm } and S 0 = {S10 , . . . , Sm 0 } be two sets of species and ρ ⊆ S × S 0 a species refinement relation. 1. Let c = [c1 , . . . , cm ]T ∈ NS and c0 = [c01 , . . . , c0m0 ] ∈ NS be two complexes over S , respectively S 0 . We say that c0 is a ρ-refinement of c, denoted c0 ∈ ρ(c), if 0 X c0j = ci , for all 1 ≤ i ≤ m 1≤j≤m0 Sj0 ∈ρ(Si ) or, equivalently, relying on the characteristic matrix, if c = Mρ c0 . 2. Let r : c → d be a reaction over S and r0 : c0 → d0 a reaction over S 0 . We say that r0 is a ρ-refinement of r, denoted r0 ∈ ρ(r), if c0 ∈ ρ(c) and d0 ∈ ρ(d). 3. Let N = (S , R) and N 0 = (S 0 , R 0 ) be two reaction networks over S and S 0 , respectively. We say that N 0 is a ρ-refinement of N , denoted N 0 ∈ ρ(N ), if R0 ⊆ [ ρ(r) and ρ(r) ∩ R 0 6= ∅, for all r ∈ R. r∈R In case R 0 = S r∈R ρ(r), we say that N 0 is the full ρ-refinement of N . 6 4. Let M = (S , R, k) and M 0 = (S 0 , R 0 , k0 ) be two mass-action reaction networks over S and S 0 , respectively. We say that M 0 is a ρrefinement of M , denoted M 0 ∈ ρ(M ), if (S 0 , R 0 ) ∈ ρ(S , R). We say that M 0 is a full ρ-refinement of M if (S 0 , R 0 ) is the full ρ-refinement of (S , R). 5. Let σ ∈ RS and σ 0 ∈ RS (thought of as the initial values for the system of ODEs associated to M and M 0 ). We say that σ 0 is a ρrefinement of σ, denoted σ 0 ∈ ρ(σ), if σ = Mρ σ 0 . 0 The definition given above for the refinement of mass-action reaction networks is only structural, describing the relationship between their sets of species and reactions. We introduce now a quantitative counterpart for the notion of refinement for mass-action reaction networks: fit-preserving refinement. Definition 9. Let M = (S , R, k) and M 0 = (S 0 , R 0 , k0 ) be two mass-action reaction networks and ρ ⊆ S ×S 0 a species refinement relation. For any σ ∈ 0 0 0 S0 0 S0 S RS ≥0 and σ ∈ R≥0 we denote by s[σ] : [0, τ ) → R≥0 (s [σ ] : [0, τ ) → R≥0 ) the vector of the real functions obtained from the ODE system associated to M (to M 0 , respectively) with initial values σ (σ 0 , respectively). We say that M 0 is a ρ-fit-preserving refinement of M if M 0 ∈ ρ(M ) and, 0 S0 0 for all σ ∈ RS ≥0 and σ ∈ R≥0 such that σ = Mρ σ , we have that s[σ](t) = Mρ s0 [σ 0 ](t), (4) for all values of t in a suitable right-neighborhood of 0. We prove in our next result that having the fit-preserving condition (4) for non-negative initial values implies that the same condition holds even for arbitrary initial values. Even though in practice the actual values are usually chosen to be non-negative (they represent concentration levels) this observation is interesting in itself and potentially useful in formulating more general results about refinement. Moreover, we show that (4) holds for all values of t for which both s[σ] and s0 [σ 0 ] are defined. Theorem 3. Let M = (S , R, k) and M 0 = (S 0 , R 0 , k0 ) be two mass-action reaction networks and ρ ⊆ S × S 0 a species refinement relation. If M 0 is a 0 ρ-fit-preserving refinement of M , then for all σ ∈ RS , σ 0 ∈ RS such that σ = Mρ σ 0 we have that s[σ](t) = Mρ s0 [σ 0 ](t), for all values of t for which both functions are defined, i.e. t ∈ dom(s[σ]) ∩ dom(s0 [σ 0 ]). Proof. For all σ 0 ∈ RS ≥0 , we know from the definition of fit-preserving re0 finement that s[Mρ σ ](t) = Mρ s0 [σ 0 ](t), for t in a right-neighborhood of 0. Moreover, the equality holds for all values of t for which both functions 0 7 are defined, via Theorem 1, since Mρ s0 [σ 0 ] and s[Mρ σ 0 ] are solutions of the same IVP. Since the equality holds in a right-neighborhood of 0, it must be that also the corresponding rates at 0 are equal, i.e. ṡ[Mρ σ 0 ](0) = Mρ ṡ0 [σ 0 ](0), which leads to X c→d∈R c kc→d (Mρ σ 0 ) (d − c) − Mρ X c0 kc0 0 →d0 σ 0 (d0 − c0 ) = 0 (5) c0 →d0 ∈R 0 for all σ 0 ∈ RS ≥0 . But since the left hand side only consists of multinomials with respect to the components of σ 0 , it follows that (5) holds for any σ 0 ∈ RS . But then we can use the equality to show that Mρ s0 [σ 0 ] satisfies the 0 initial value problem of s[Mρ σ 0 ] for any σ 0 ∈ RS , which leads by Theorem 1 to the desired result. Note that the result of Theorem 3 relies both on the fact that mass-action ODEs are multinomials with respect to the species’ concentrations, and on the behavior of the refinement matrix Mρ . With the stronger requirements of Theorem 3 we aim to capture the maximal extent to which the refined and original models can describe the same dynamic behavior. The problem we focus on is how to effectively construct a fit-preserving refinement of a given mass-action reaction network M when given the species refinement relation ρ. The first part is to build the full structural ρ-refinement of M ; this can be done by constructing ρ(c) × ρ(d) for all reactions c → d of M . The second part is to set the kinetic rate constants of the refined model so that it yields a fit-preserving refinement. A difficulty in checking whether a given numerical setup of the full structural refinement yields a fit-preserving refinement is that the system of ODEs is non-linear and cannot be solved analytically in general. We recall the following problem of [3], with the formulation given in [4]. Problem 1. [3] Let M be a mass-action reaction network, ρ a species refinement relation, and M 0 a full structural ρ-refinement of M . Assuming that numerical values of some of the kinetic rate constants of M 0 are fixed, find a numerical setup for all its other kinetic rate constants so that M 0 is a fit-preserving refinement of M . In our previous work [4] we gave a partial answer to Problem 1 in the form of a sufficient condition on the kinetic rate constants that, if satisfied, would entail that the considered full structural refinement is fit-preserving. We now turn this result around and we introduce the class of full structural refinements satisfying the constraint as canonical refinements. This will allow us to formulate a stronger result, one that also discusses necessity in addition to sufficiency. 8 Definition 10. Let M = (S , R, k) and M 0 = (S 0 , R 0 , k0 ) be two massaction reaction networks and ρ ⊆ S × S 0 a species refinement relation. We say that M 0 is a canonical ρ-refinement of M if M 0 is a full ρ-refinement of M and, for every c → d ∈ R and every c0 ∈ ρ(c), we have that ! X d0 ∈ρ(d) kc0 0 →d0 c = 0 kc→d , where c ! Q|S | c i=1 ci ! . = Q|S 0| 0 0 c j=1 cj ! Note that the constraint from Definition 10 is not far from what one would expect. Indeed, the rate constants of all refined reactions that share the same left-hand side c0 depend on the rate constant of the parent reaction, kc→d , and on its left hand side c. The interesting aspect is the linear character of the dependency. Theorem 4. Let M = (S , R, k) and M 0 = (S 0 , R 0 , k0 ) be two reaction networks such that M 0 is a full ρ-refinement of M . 1. If M 0 is a canonical ρ-refinement of M , then M 0 is a fit-preserving ρ-refinement of M . 2. If M has uniquely identifiable rate constants, then M 0 is a fit-preserving ρ-refinement of M if and only if M 0 is a canonical ρ-refinement of M . Note that part 1 is the one we have already presented, without proof, in [4]. Here, we are going to give a unified proof of both results, one that will also highlight the important elements that distinguish the particular problem of refining mass-action reaction networks from a very broad notion of refinement, which is captured in Lemma 1. To see why the scenario presented in Lemma 1 is a very general form of fit-preserving refinement, assume that we are trying to describe the dynamics of a complex system, for which we have devised two models. One has n state parameters and is characterized by ȧ = F (a), while the other one has m parameters and satisfies ḃ = G(b). In order to compare the two models, we need a way to relate the solutions of the two ODEs. The key ingredient for that is a direct correspondence between the states of the two models, which is given by the function Φ. In this context, a[Φ(β)] = Φ ◦ b[β] essentially means that we expect that, given an initial value β in the second model, the solution a for the first model initialized at the image of β through Φ is the same as the image of the solution b initialized at β. This is very similar to the condition we imposed for fit-preserving refinement, with the observation that in our case the correspondence function Φ is simply the left multiplication with the species refinement matrix Mρ . 9 In what follows, for a given function Φ : Rm → Rn , we will use Φ0 (x) to denote the derivative of Φ at x, i.e. the linear operator defined by the Jacobian matrix: 0 Φ (x) = ∂Φ1 (x) ∂x1 ∂Φ2 (x) ∂x1 .. . ∂Φn (x) ∂x1 ∂Φ1 (x) ∂x2 ∂Φ2 (x) ∂x2 ··· ··· .. . .. . ∂Φn (x) · · · ∂x2 ∂Φ1 (x) ∂xm ∂Φ2 (x) ∂xm .. . ∂Φn (x) ∂xm . As before, we still use the dot to denote the derivative with respect to time. Lemma 1. Let F : Rn → Rn , G : Rm → Rm and Φ : Rm → Rn be continuously differentiable functions. The following two statements are equivalent: 1. For every β ∈ Rm , the (unique maximal) solutions a[Φ(β)] and b[β] of the initial value problems ȧ = F (a), a(0) = Φ(β), respectively ḃ = G(b), b(0) = β satisfy a[Φ(β)] = Φ ◦ b[β], i.e. a[Φ(β)](t) = Φ(b[β](t)) for all t ∈ dom(a[Φ(β)]) ∩ dom(b[β]). 2. F ◦ Φ = Φ0 · G, i.e. for all β ∈ Rm , F (Φ(β)) = Φ0 (β)G(β). Proof. 1 ⇒ 2: We start from a[Φ(β)] = Φ ◦ b[β]. We can differentiate the relation to get ȧ[Φ(β)] = Φ0 ◦ b[β] · ḃ[β]. But since a and b are the solutions of the corresponding differential equations, we can write F ◦ a[β] = Φ0 ◦ b[β] · G ◦ b[β]. Since the relation holds for all values of t, we can take t = 0 and use the fact that b[β](0) = β and a[Φ(β)](0) = Φ(β) to get F (Φ(β)) = Φ0 (β) · G(β), which is the desired result. 2 ⇒ 1: We start with F ◦ Φ = Φ0 · G. Let us see that in this case Φ ◦ b[β] satisfies a’s ODEs. Indeed, we have (Φ ◦ b[β])0 = Φ0 ◦ b[β] · ḃ[β] = Φ0 ◦b[β]·G◦b[β] = (Φ0 ·G)◦b[β] = F ◦Φ◦b[β], where we assume that function composition binds stronger than function products. We have obtained the desired condition from a’s ODEs. Note also that we have (Φ ◦ b[β])(0) = Φ(β). Thus, by Theorem 1, it must be that dom(b[β]) ⊆ dom(a[Φ(β)]) and a[Φ(β)](t) = Φ(b[β](t)), for all t ∈ dom(b[β]), which is the desired result. The result from Lemma 1 conveys the idea that in order for fit-preserving refinement to be possible, there should be a strong connection between the three functions F , G and Φ. Lemma 2. Let S and S 0 be two finite sets, of sizes m and m0 , respectively, 0 and let ρ ⊆ S × S 0 be a species refinement relation. Then, for any x ∈ RS ≥0 and c ∈ NS it holds that ! (Mρ x)c = X c0 ∈ρ(c) 10 c c0 x . c0 Proof. Without loss of generality, we can assign names to the elements of S , let them be S1 , . . . , Sm . Note that ρ induces a partition of the elements of S 0 into sets of subspecies, one for each of the species in S . Thus, we can use Sij0 to refer to the elements in ρ(Si ), with j ranging from 1 to ni = |ρ(Si )| P 0 and m i=1 ni = m . Furthermore, we use xij to refer to the element of x that corresponds to Sij0 . Recall that Mρ computes the sum of subspecies for each species, i.e.: Mρ x = n1 X x1j , j=1 n2 X x2j , . . . , j=1 nm X T xmj . j=1 Then we can write: c (Mρ x) = m Y ni X i=1 ci xij = j=1 ni m X Y Y ci ! c0ij xij , Qni 0 0 j=1 cij ! j=1 i=1 [c ,...,c0 ]T ∈Nni i1 ini Pni 0 c =ci j=1 ij where we have used multinomial expansion. Now, since for different values of i the corresponding c0ij ’s are independent, we can rewrite the result by swapping the sum and product as: c (Mρ x) = [c0i1 ,...,c0in ]T ∈Nni Pni ! ni m Y X Y ci ! c c0 c0 xijij = x , Qni 0 c0 j=1 cij ! i=1 i=1 j=1 0 S0 m Y X ! c ∈N Mρ c0 =c i c0 =ci j=1 ij 1≤i≤m which is the desired result. We are now ready to prove Theorem 4. of Theorem 4. In what follows we use C = {c ∈ NS | c → d ∈ R} and 0 C 0 = {c0 ∈ NS | c0 → d0 ∈ R 0 } to denote the source complexes of M and M 0 , respectively. In this case, since M 0 is a full ρ-refinement of M , we can S write C 0 = c∈C ρ(c). We start by noting that the fit-preservation condition from Definition 9 is an instantiation of statement 1 from Lemma 1 with variables a, b, x replaced by s, s0 , σ 0 , respectively, and: X F (s) = kc→d sc (d − c), c→d∈R 0 G(s ) = 0 Φ(σ ) X 0 kc0 →d0 s0c (d0 − c0 ), c0 →d0 ∈R 0 = Mρ σ 0 , Φ0 (σ 0 ) = Mρ . 11 Thus, we can write ∀σ 0 . s[Mρ σ 0 ] = Mρ s0 [σ 0 ] ⇐⇒ ∀σ 0 . F (Mρ σ 0 ) = Mρ G(σ 0 ) Next, in the latter equality we replace the left and right hand sides with the corresponding mass-action formulas and rely on Lemma 2 to obtain F (Mρ σ 0 ) = kc→d (Mρ σ 0 )c (d − c) X c→d∈R ! X = kc→d c 0 σ 0c (d − c) 0 c X c0 ∈ρ(c) c→d∈R ! X = X kc→d c∈C d∈NS c→d∈R X = σ X c0 ∈ρ(c) ! 0c0 ! c kc→d (d − c) c0 X d∈NS c=Mρ c0 c→d∈R c0 ∈C 0 c 0 σ 0c (d − c) 0 c and Mρ G(σ 0 ) = Mρ 0 kc0 →d0 σ 0c (d0 − c0 ) X c0 →d0 ∈R 0 X = X 0 kc0 →d0 σ 0c (Mρ d0 − Mρ c0 ) X c→d∈R c0 ∈ρ(c) d0 ∈ρ(d) X = c0 ∈C 0 σ 0c0 X X d∈NS kc0 →d0 (d − c) . d0 ∈ρ(d) c=Mρ c0 c→d∈R Moving both terms to the left hand side, we can conclude that M 0 is a 0 fit-preserving refinement of M if and only if, for all σ 0 ∈ NS : X c0 ∈C 0 σ 0c0 X d∈NS c=Mρ c0 c→d∈R ! X c kc→d − kc0 →d0 (d − c) = 0. 0 c d0 ∈ρ(d) Claim 1 of the theorem, as well as the ⇐ part of 2, follows directly from the equation above, since the condition for canonical refinement makes the difference in the parentheses vanish. For the ⇒ part of claim 2 we first use the fact that the sum on the left hand side is actually a multinomial where 0 σ 0c are monomials, so all the inner sums (as coefficients of the monomials) must vanish. Next, we deduce the condition for canonical refinement from the fact that the vectors d−c under the summation are linearly independent, by Theorem 2, since M has uniquely identifiable rate constants. 12 Note that we can now distinguish several versions of the actual notion of refinement. In the most general sense, we talk about any continuously differentiable mappings F and G and any continuously differentiable mapping Φ that defines the desired behavior when going from Rm to Rn . Already in this very general setting we can impose strong constraints on the three mappings, as shown in Lemma 1. Next, if we require that the mapping Φ corresponds to a species refinement, we end up using a very particular kind of linear mapping. The last specialization step consists in the use of mass-action formulas for F and G. These intermediary steps open up the opportunity to investigate in future work the more general versions of fit-preserving refinement. 4 Examples In this section we give examples of how canonical refinement can be used for obtaining fit-preserving refinements of mass-action reaction networks. In addition to the examples provided here, the reader may find our analysis of the Lotka-Volterra prey-predator model in [5]. First, to see why the result presented in [4] (and here as part 1 of Theorem 4) is only sufficient, consider the following mass-action reaction network: k 2A −−1→ A , k 2A −−2→ 3A . Such a model can, for example, characterize an ecological model describing the dynamics of some population A. The first reaction could encode the encounters that result in the death of the weaker individual, whereas the second reaction can stand for mating interactions. If we are to write the corresponding ODE for this network, we get: ȧ = (k2 − k1 )a Clearly the dynamics of this system is completely characterized by k2 −k1 and it is only this difference that can be fitted against experimental data, the actual values themselves cannot be determined uniquely. Based on Theorem 2 we expect that the reaction vectors for the unique source complex of this network are not linearly independent. Indeed, the (unidimensional) vectors are [−1] and [1] and are linearly dependent. Note that more intricate examples can also be constructed based on this idea. Consider now that we have two instances of this mass-action reaction network, say M and M 0 with rate constants k1 and k2 , respectively k10 and k20 , such that k2 − k1 = k20 − k10 but k1 6= k10 and k2 6= k20 . A canonical 13 refinement of M , call it Mr , is also a fit-preserving refinement of M , by Theorem 41. Moreover, since M and M 0 characterize the same dynamical behavior, it is also the case that Mr is a fit-preserving refinement of M 0 . On the other hand, Mr is not a canonical refinement for M 0 . An equivalent (with respect to the associated ODEs) mass-action reaction network can be constructed for this example, by keeping only one of k1 −k2 the two reactions. We can choose between 2A −− −→ A (the second reaction is dropped, but accounted for in the rate constant of the first reaction) or k2 −k1 2A −− −→ 3A (the first reaction dropped). Note that the actual choice matters, since it constrains the possible solutions (the kinetic rate constants need to be nonnegative). Thus, if we know that k1 ≥ k2 , we should use the first reaction, otherwise we must use the second one. In either case, we obtain a model that is equivalent to the original one (for the specific relation between k1 and k2 that is assumed) and, moreover, has uniquely identifiable rate constants. The second model that we are going to discuss is the Brusselator [9]. The usual formulation of the model is presented below. A −−→ X; 2X + Y −−→ 3X; B + X −−→ Y + D; X −−→ E. (6) All the rate constants are taken to be equal to 1 and, additionally, it is assumed that the concentrations of A and B are constant, either by being maintained from outside the system, or as an approximation, considering that the two concentrations are significantly larger than those of the other species in the model. The analysis of the Brusselator generally refers to the time evolution of X and Y , i.e. it is assumed that D and E are taken away from the system as they are produced. Note that, in the form presented above, this model is not a mass-action reaction network in the sense of Definition 4, because of the assumption that A and B are constant (rather than have their dynamical behavior follow a mass-action equation). The results from this paper can be generalized to cover this particular case as well by focusing on the associated ODEs. However, an important feature of our approach is the fact that canonical refinements (and implicitly fit-preserving refinements) can be computed based on the model representation alone (reactions and rate constants), without the need to write down the ODEs. Thus, in what follows we use a different formulation of the Brusselator, one that preserves the behavior of X and Y but imposes no constraints on the species, other than the mass-action law. It is not difficult to see that D and E can safely be removed from the 14 model, since they have no impact on the dynamics of X and Y . Furthermore, we aim to remove A and B as well, but account for the values of their (constant) concentrations in the new formulation so that X and Y preserve their behavior. For the original formulation (6), the associated ODEs are: ẋ = a + x2 y − bx − x, ẏ = bx − x2 y, where we have used lowercase letters to denote the corresponding concentration of each species. We can now rely on the fact that a and b are constant to obtain the same ODEs via mass-action kinetics for the following model: a r1 : ∅ −−→ X p r2 : 2X + Y −−→ 3X b r3 : X −−→ Y q r4 : X −−→ ∅ for p = q = 1. For this simplified version of the Brusselator, we consider the refinement of X into X1 and X2 . The full refinement of the reaction network is presented below: n a a r1 : ∅ −−1→ X1 , ∅ −−2→ X2 ; p00 2X1 + Y1 −− → 3X1 , p02 2X1 + Y1 −−→ X1 + 2X2 , 10 X + X + Y −p− → 3X , p 01 2X1 + Y1 −− → 2X1 + X2 , p03 2X1 + Y1 −−→ 3X2 , p11 X1 + X2 + Y1 −− → 2X1 + X2 , 1 2 1 1 r2 : p12 p13 X1 + X2 + Y1 −−→ X1 + 2X2 , X1 + X2 + Y1 −−→ 3X2 , p20 p21 2X2 + Y1 −− → 3X1 , 2X2 + Y1 −− → 2X1 + X2 , p22 p23 2X2 + Y1 −−→ X1 + 2X2 , 2X2 + Y1 −−→ 3X2 ; n b1 b r3 : X1 −− → Y1 , X2 −−2→ Y1 ; n q q r4 : X1 −−1→ ∅, X2 −−2→ ∅. We have grouped the reactions of the refined model by the corresponding reaction in the initial model. Furthermore, we used horizontal lines to separate the 12 refinements of r2 into groups that share the same left hand side. In order for the full refinement to be canonical, the following conditions must be satisfied: 15 Parameter Sum Case 1 Case 2 Case 3 Case 4 a1 , a2 a 3/4, 1/4 3/4, 1/4 3/4, 1/4 1, 0 p00 , p01 , p02 , p03 p 0, 1, 0, 0 1, 0, 0, 0 1, 0, 0, 0 1, 0, 0, 0 p10 , p11 , p12 , p13 2p 0, 0, 1, 0 0, 1/2, 1/2, 0 0, 1/3, 2/3, 0 0, 0, 1, 0 p20 , p21 , p22 , p23 p 0, 0, 0, 1 0, 0, 0, 1 0, 0, 0, 1 0, 0, 0, 1 b1 b 1 1 1 1 b2 b 1 1 1 1 q1 q 1 1 1 1 q2 q 1 1 1 1 x1 (0), x2 (0) x(0) 3/4, 1/4 3/4, 1/4 3/4, 1/4 0.99, 0.01 y1 (0) y(0) 1 1 1 1 Table 1: Parameter setup for each of the four fit-preserving refinements. a1 + a2 = a, p00 + p01 + p02 + p03 = p, p10 + p11 + p12 + p13 = 2p, p20 + p21 + p22 + p23 = p, b1 = b2 = b, q1 = q2 = q. Note that, for the canonical refinement, the values of b1 , b2 , q1 and q2 are already determined. For the other rate constants we can choose between different numerical setups, as long as the constraints are satisfied. In what follows we show that, while any canonical refinement guarantees that X1 +X2 and Y1 behave exactly the same as X and Y , respectively, the behaviors of X1 and X2 can be very different. For the original model we fixed the values a = p = q = 1 and b = 2.5. Furthermore, we considered the initial values x(0) = 2 and y(0) = 2. The resulting simulation is presented in Figure 1a. We consider four different fit-preserving refinements, corresponding to the parameter values indicated in Table 1. Note that the values are presented as the fraction of the overall sum enforced by canonical refinement. In addition to the chosen values for each case, we also provide possible assumptions that can lead to these values. The first three scenarios assume a 3 : 1 ratio of X1 to X2 in X, both for the initial values and for the X that is added to the system (the refinement of the first reaction). One possible interpretation of this setup is that the chemical element X has two isotopes X1 and X2 , and the naturally available X contains them in the chosen proportions. Furthermore, we read the third reaction of the Brusselator model 16 (a) Original model. (b) Case 1: Y1 can only be converted to X2 . (c) Case 2: Y1 can be converted to X1 as well as X2 , no preference. (d) Case 3: Y1 can be converted to X1 as well as X2 , preference for X2 . (e) Case 4: Conversion to X2 preferred, only X1 added from the outside. Figure 1: The Brusselator and its four proposed canonical, fit-preserving refinements. as follows: in the presence of two X molecules, a molecule of Y can be converted into X. The presented scenarios will extend this interpretation with assumptions about the difference between X1 and X2 with respect to this reaction. Case 1 In this scenario we assume that, in fact, Y1 can only be converted to X2 via the third reaction and we encode this assumption in the values chosen for the rate constants of the refined reactions, i.e., we take p01 = p, p12 = 2p, p23 = p and pij = 0 for all other refined reactions. Note that the ODE for X1 becomes: x˙1 = a − (b + q)x1 . Thus, the behavior of X1 only depends on the rate constants of the model and on the initial concentration of X1 . In particular, the steady state is given by x1 = a/(b + q). This is consistent with the plot from Figure 1b, where a/(b + q) = 0.25. Moreover, the concentration of X1 rapidly approaches the equilibrium value and the periodic behavior of X is given by X2 alone. 17 Case 2 We assume now that Y1 can be converted to X1 as well as X2 , and the actual outcome depends on the two X molecules that Y interacts with. In the presence of two molecules of X1 , Y1 will be converted to X1 , i.e. p00 = p and p01 = p02 = p03 = 0. Similarly, in the presence of two molecules of X2 , Y1 becomes X2 , i.e. p23 = p and p20 = p21 = p22 = 0. Last, but not least, when we have X1 and X2 interacting with Y1 , we assume an equal probability of obtaining X1 or X2 , i.e. p11 = p12 = p and p10 = p13 = 0. In this scenario we expect that both X1 and X2 exhibit similar behavior due to the symmetry of the refined reactions. Indeed, we can see in Figure 1c that the fractions of X1 and X2 in the total concentration of X remains the same throughout the simulation. To see also formally that this is the case, let us consider the ODEs for x1 and x2 in this case: ẋ1 = a1 + px21 y1 + px1 x2 y1 − (b + q)x1 , ẋ2 = a2 + px1 x2 y1 + px22 y1 − (b + q)x2 . Now, we use the fact that x1 + x2 = x (guaranteed by the construction of the fit-preserving refinement), to write x2 = x − x1 in the first ODE, then we divide both sides by a1 to get: x1 x1 ẋ1 = 1 + pxy1 − (b + q) . a1 a1 a1 Using the same approach for x2 , we obtain that x1 /a1 and x2 /a2 satisfy the same ODE. If we also assume that the initial values are the same, i.e. x1 (0)/a1 = x2 (0)/a2 (which is true for our scenario), then we have x1 /a1 = x2 /a2 . This can be rewritten as x1 /x2 = a1 /a2 and completes the proof of our claim. If the initial ratio x1 /x2 is not equal to a1 /a2 , then it will converge to this value. Case 3 Similarly to the previous scenario, we assume that Y1 is converted to X1 (respectively X2 ) when it interacts with two molecules of X1 (respectively X2 ). For the interaction with one X1 molecule and one X2 molecule, we assume that conversion to X2 is favored, i.e. we take p11 = 2p/3 and p12 = 4p/3. The result from Figure 1d shows that X1 and X2 no longer have the same proportion in X, and their peak values are no longer synchronized in time. Case 4 In this scenario we push the bias of X1 +X2 +Y1 towards producing only X1 + 2X2 . Furthermore, we change the assumptions for a1 and a2 so that only X1 is introduced in the system. Moreover, in the initial values, we take x1 (0) = 0.99x(0) and x2 (0) = 0.01x(0). With these assumptions, it can be seen in Figure 1e that, even though X2 starts with a very low 18 concentration, it does not disappear from the system, but again a periodic behavior is reached. Note also that the concentration of X2 periodically surpasses that of X1 , even though we only add X1 from outside the system. The four scenarios presented in this section are meant to show that fitpreserving refinement does not generate very restricted behaviors on the refined species, even though their sums are constrained by the dynamics of the original model. Furthermore, if experimental measurements are available for the refined species, the observed behavior can be compared with one of the cases presented here (possibly with other ones as well) so as to guide the model fitting process. We conclude this section with a discussion of rate constants identifiability for the Brusselator. The original model has a single reaction for each source complex (i.e. there are no two reactions that have the same left hand side) and, thus, it trivially satisfies the condition of Theorem 2. By Theorem 4 we can conclude that full refinements of the model are fit-preserving if and only if they are canonical. For the full refinement, however, we do have reactions with the same left hand side, namely refinements of r1 and r2 . We assume that complexes contain the stoichiometric coefficients in the order X1 , X2 , Y1 , and we compute the rank of the matrices containing the difference vectors as columns. The matrix for the refinements of r1 is 1 0 R1 = 0 1 0 0 and has rank 2, which means that the difference vectors are linearly independent. For the refinements of r2 we have three matrices, one for each source complex: R21 1 0 −1 −2 1 2 3 = 0 , −1 −1 −1 −1 R22 R23 2 1 0 −1 0 1 2 = −1 , −1 −1 −1 −1 3 2 1 0 −2 −1 0 1 = . −1 −1 −1 −1 All three matrices have rank 2 and, thus, if we wish to have uniquely identifiable rate constants, we can keep at most 2 reactions from each group that shares the same left hand side (i.e. the rate constants for the other reactions should be set to zero). Note that the four scenarios we have presented do satisfy this requirement. If we have numerical values for all four rate constants of the refined reactions, we can obtain an equivalent model with uniquely identifiable rate 19 constants, by choosing a suitable generator for the set of difference vectors, so that the rate constants of the reduced model are still positive. On the other hand, when trying to fit the model we do not know the rate constants, so a suitable approach would be to consider (as initialization) one instance for each set of linearly independent refined reactions and then choose the best fit. 5 Conclusions In this paper we improved our previous result from [4] by providing a complete characterization of fit-preserving refinement via a constraint relating the parameters of the refined model to those of the original model. The characterization applies as long as the original model has uniquely identifiable rate constants, a requirement that is critical for fitting the model [1]. Whenever applying data refinement to such models, we can deduce that every numerical setup that is fit-preserving corresponds to a canonical quantitative refinement. This means that we can initialize parameter estimation routines with these values and, thus, ensure that the resulting model fits the original data at least as well as the initial model did. Furthermore, we have seen in Section 4 that the behavior of refined species varies greatly with the chosen rate constants, even when the refined model is constrained to exhibit the same dynamics with respect to the species of the original model. The full refinement of a model does not have, in general, uniquely identifiable rate constants. Thus, we need to choose, from each set of refined reactions that share the same left hand side, a subset of linearly independent reactions. Considering all possible such refinements and simulating them, we can identify different classes for the behavior of the refined species and use them for guiding the model fitting process when experimental data is available. The very simple constraints used for the canonical refinement rely on the interplay between multinomial expansion and the formulation of mass-action dynamics. It would be interesting to investigate fit-preserving refinement for other kinetic models as well. Furthermore, the proof of our result reveals several versions of refinement, ranging from very general refinement of ODE models where the functions constraining the dynamics are not specified, to the very specific case of mass-action and the additive constraint on the behavior of refined species. These intermediary versions can provide additional insight for building models via refinement. An interesting case to consider in future work is the refinement of massaction reaction networks where the chemical structure of species is known, i.e., there is a distinction between atomic and complex species. In such cases, one can consider species refinement only for atomic species, then use it to 20 construct the refinement of complex species (see, e.g. [2]). The structural information can be used in this case to impose additional constraints on the rate constants of the refined reactions, e.g., by specifying mass conservation relations. References [1] Gheorghe Craciun and Casian Pantea. Identifiability of chemical reaction networks. Journal of Mathematical Chemistry, 44(1):244–259, 2008. [2] Elena Czeizler, Eugen Czeizler, Bogdan Iancu, and Ion Petre. Quantitative model refinement as a solution to the combinatorial size explosion of biomodels. Electronic Notes in Theoretical Computer Science, 284:35–53, 2012. [3] Eugen Czeizler, Vladimir Rogojin, and Ion Petre. The phosphorylation of the heat shock factor as a modulator for the heat shock response. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9(5):1326–1337, 2012. [4] C. Gratie and I. Petre. Fit-preserving data refinement of mass-action reaction networks. In Language, Life, Limits, pages 204–213. Springer, 2014. [5] Cristian Gratie and Ion Petre. Fit-preserving data refinement of massaction reaction networks. Technical report, TUCS, 2014. [6] Morris William Hirsch, Stephen Smale, and Robert Luke Devaney. Differential equations, dynamical systems and an introduction to chaos, volume 60 of Pure and Applied Mathematics. Academic Press, 2 edition, 2004. [7] Edda Klipp, Ralf Herwig, Axel Kowald, Christoph Wierling, and Hans Lehrach. Systems Biology in Practice. WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim, 2005. [8] David L. Nelson and Michael M. Cox. Lehninger Principles of Biochemistry. WH Freeman, 2004. [9] Grégoire Nicolis and Ilya Prigogine. Self-Organizationin Nonequilibrium Systems: From Dissipative Structures to Order Through Fluctuations. Wiley, 1977. 21 Joukahaisenkatu 3-5 A, 20520 TURKU, Finland | www.tucs.fi University of Turku Faculty of Mathematics and Natural Sciences • Department of Information Technology • Department of Mathematics Turku School of Economics • Institute of Information Systems Sciences Åbo Akademi University • Department of Computer Science • Institute for Advanced Management Systems Research ISBN 978-952-12-3172-8 ISSN 1239-1891
© Copyright 2026 Paperzz