Exploring the gap between dynamic and constraint-based models of metabolism

doi:10.1016/j.ymben.2012.01.003

Metabolic Engineering

Volume 14, Issue 2, March 2012, Pages 112-119

https://doi.org/10.1016/j.ymben.2012.01.003 Get rights and content

Abstract

Systems biology provides new approaches for metabolic engineering through the development of models and methods for simulation and optimization of microbial metabolism. Here we explore the relationship between two modeling frameworks in common use namely, dynamic models with kinetic rate laws and constraint-based flux models. We compare and analyze dynamic and constraint-based formulations of the same model of the central carbon metabolism of Escherichia coli. Our results show that, if unconstrained, the space of steady states described by both formulations is the same. However, the imposition of parameter-range constraints can be mapped into kinetically feasible regions of the solution space for the dynamic formulation that is not readily transferable to the constraint-based formulation. Therefore, with partial kinetic parameter knowledge, dynamic models can be used to generate constraints that reduce the solution space below that identified by constraint-based models, eliminating infeasible solutions and increasing the accuracy of simulation and optimization methods.

Highlights

► Constraint-based and full kinetic models share the same set of steady states. ► Partial rate-constant knowledge can further constrain kinetic models. ► Such reductions can be transferred from kinetic to constraint-based formulations.

Introduction

The prevalence of systems approaches to biological problems has renewed interest in mathematical models as fundamental research tools for performing in silico experiments of biological systems (Kitano, 2002). In the context of metabolic engineering, models of metabolism play an important role in the simulation of cellular behavior under different genetic and environmental conditions (Stephanopoulos, 1998). Typical experiments include knockout simulations to study how metabolic flux distributions readjust throughout a given network. With the selection of an optimal set of knockouts or changes in enzyme expression levels, it is desirable to optimize the production of compounds of industrial interest (Burgard et al., 2003, Patil et al., 2005).

Systems of ordinary differential equations (ODEs) have been applied in different areas to model dynamical systems. In the context of metabolic networks, they describe the rate of change of metabolite concentrations. These dynamic models contain rate law equations for the reactions as well as their kinetic parameters and initial metabolite concentrations. Building this type of model requires insight into enzyme mechanism to select appropriate rate laws, as well as experimental data for parameter estimation. Therefore, their application has been more limited, but areas of application include central metabolic pathways of well-studied organisms such as Escherichia coli (Chassagnole et al., 2002) and Saccharomyces cerevisiae (Rizzi et al., 1997). There are, however, some recent efforts to overcome these limitations in the reconstruction of large-scale dynamic models, such as through the hybrid dynamic/static approach (Yugi et al., 2005), the ensemble modeling approach (Tran et al., 2008), and the application of approximate kinetic formats using stoichiometric models as a scaffold (Smallbone et al., 2010, Jamshidi and Palsson, 2010). Nevertheless, these techniques have so far been applied to very few organisms.

On the other hand, advances in genome sequencing have facilitated the reconstruction of genome-scale metabolic networks for several organisms, with over 50 reconstructions available to date (Oberhardt et al., 2011). Due to the lack of kinetic data at the genome scale, this type of model only accounts for reaction stoichiometry and reversibility. Analysis is performed under the assumption of steady state using a constraint-based formulation that is underdetermined, resulting in a continuous space of solutions for the reaction flux distributions. This uncertainty of the flux distributions requires additional conditions to determine unique solutions and predictions. Often this takes the form of an optimization based on a particular assumption, such as optimal biomass growth for wild-type (Edwards and Palsson, 2000) and minimization of cellular adjustments for knock-out strains (Segrè et al., 2002, Shlomi et al., 2005). The inclusion of regulatory constraints, introduced by Covert and Palsson (2003), is a current approach to reduce the size of the solution space and eliminate infeasible solutions. One limitation of the constraint-based approach is the inability to express transient behavior. In order to simulate fermentation profiles, a few methods have been developed to integrate the variation of external concentrations while assuming an internal pseudo-steady state (Oddone et al., 2009, Leighty and Antoniewicz, 2011).

The two most common model types in use, therefore, represent two extremes. The dynamic ODE formulation contains detailed mechanistic information that gives solutions of the transient dynamic approach to equilibrium from any given set of initial conditions (generally concentrations of enzymes and metabolites), as well as the steady state specified by metabolite concentrations that depend on total enzyme concentrations (for the usual case where they are treated as fixed) but often do not depend on the initial metabolite concentrations. Steady-state fluxes are readily computed from the steady-state concentrations and the rate laws. The constraint-based formulation seems minimalist by comparison: it has no mechanistic knowledge of any of the chemical reactions beyond their stoichiometry, its solutions have fluxes at steady state but no information regarding concentrations or dynamics, and rather than giving a unique solution, it produces a high-dimensional continuum of steady-state solutions (referred to as a flux cone). The dynamic formulation needs significant information (parameters in term of rate constants and total enzyme concentrations, as well as reaction mechanisms to give rate laws), but generally rewards that effort with unique and detailed solutions. The constraint-based formulation requires less (no parameters except maximum fluxes) but delivers less.

Because of these significant differences between dynamic and constraint-based formulations, they treat the effects of network perturbations, which might be undertaken as part of a metabolic engineering study, very differently. A dynamic formulation will make very specific predictions about the response to a gene knockout, for example, but generally such models lack information about gene regulatory changes that accompany metabolic changes, and so without foreknowledge to adjust relative enzyme concentrations, such predictions can be significantly in error. Constraint-based formulations can access all possible steady-state solutions but can only rely on relatively simple heuristics to select among them, and are uncertain how to include specific information on gene regulatory changes.

Here we explore further the relationship between these formulations by essentially considering the continuous ensemble of dynamic formulations obtained by varying parameters (principally rate constants and enzyme concentrations) and compare the steady-state solutions to those from the corresponding constraint-based formulation. We find an equivalence between the sets of steady states when only maximum flux constraints are present, but that more specific constraints and enzyme concentrations can be directly incorporated to define a reduced dynamic ensemble that is significantly more informative regarding possible steady-state solutions than the constraint-based formulation.

Section snippets

Models

We have used a dynamic model of the central carbon metabolism of E. coli (Chassagnole et al., 2002) available at the Biomodels database (Le Novere et al., 2006). The model was converted from its original SBML format into a MATLAB (The Mathworks; Natick, MA, USA) file that was used for all computations in this work. The model consists of a total of 18 metabolites and 31 reactions, including several enzymatic reactions, one exchange reaction, and a few lumped versions of biosynthetic pathways.

Results

In order to explore the gap between both types of formulations, we analyzed and compared the dynamic and constraint-based formulations of the same model of the central carbon metabolism of E. coli (Chassagnole et al., 2002) (see Methods for model formulation).

Our goal is to compare the steady states achievable by the two model types. Intuitively the dynamic formulation has more constraints than the constraint-based one because the later only enforces the steady-state condition and maximum flux

Discussion

We have analyzed and compared dynamic and constraint-based formulations of the same model for the central carbon metabolism of E. coli (Chassagnole et al., 2002). The constraint-based version does not account for metabolite concentrations, and it does not express transient behavior. Therefore, the formulations can only be compared in their common domain, which is the steady-state flux distribution.

The constraint-based model defines a solution space for the steady-state flux distribution (called

Conclusions

In this work we have explored the solution spaces of both dynamic and constraint-based models in order to bring together top-down and bottom-up approaches, and we have proposed methods of treating each as well as their interrelation.

Dynamic model reconstruction is a bottom-up approach for iteratively building large-scale metabolic pathways with kinetic detail. Due to a lack of experimental data, differences in experimental conditions, and measurement uncertainty, the kinetic parameters are

Acknowledgments

This research was supported by PhD Grants SFRH/BD/35215/2007 and SFRH/BD/25506/2005 from the Fundação para a Ciência e a Tecnologia (FCT) and the MIT–Portugal Program through the project “Bridging Systems and Synthetic Biology for the Development of Improved Microbial Cell Factories” (MIT-Pt/BS-BB/0082/2008).

References (32)

D. Beard et al.
Energy balance for analysis of complex metabolic networks
Biophys. J.
(2002)
M. Covert et al.
Constraints-based models: regulation of gene expression reduces the steady-state solution space
J. Theor. Biol.
(2003)
N. Jamshidi et al.
Mass action stoichiometric simulation models: incorporating kinetics and regulation into stoichiometric models
Biophys. J.
(2010)
R. Leighty et al.
Dynamic metabolic flux analysis (DMFA): a framework for determining fluxes at metabolic non-steady state
Metab. Eng.
(2011)
G. Oddone et al.
A dynamic, genome-scale flux model of Lactococcus lactis to increase specific recombinant protein expression
Metab. Eng.
(2009)
L. Tran et al.
Ensemble modeling of metabolic networks
Biophys. J.
(2008)
S. Wiback et al.
Monte Carlo sampling can be used to determine the size and shape of the steady-state flux space
J. Theor. Biol.
(2004)
W. Wiechert
13C metabolic flux analysis
Metab. Eng.
(2001)
A. Burgard et al.
OptKnock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization
Biotechnol. Bioeng.
(2003)
C. Chassagnole et al.
Dynamic modeling of the central carbon metabolism of Escherichia coli
Biotechnol. Bioeng.
(2002)

J. Edwards et al.

Metabolic flux balance analysis and the in silico analysis of Escherichia coli K-12 gene deletions

BMC Bioinformatics

(2000)

E. Gianchandani et al.

Matrix formalism to describe functional states of transcriptional regulatory systems

PLoS Comput. Biol.

(2006)

A. Hoppe et al.

Including metabolite concentrations into flux balance analysis: thermodynamic realizability as a constraint on flux distributions in metabolic networks

BMC Syst. Biol.

(2007)

R. Ibarra et al.

Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth

Nature

(2002)

H. Kitano

Systems biology: a brief overview

Science

(2002)

N. Le Novere et al.

BioModels database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems

Nucleic Acids Res.

(2006)

Cited by (26)

Digital models in biotechnology: Towards multi-scale integration and implementation
2022, Biotechnology Advances
Citation Excerpt :
Kinetic modelling aims to describe the cell at the molecular level more precisely. By incorporating enzyme kinetics, insights regarding fractional states of enzymes and regulation mechanisms at a network level can be provided (Haiman et al., 2021; Machado et al., 2012; Yasemi and Jolicoeur, 2021). Kinetic modelling is a powerful modelling framework in industrial biotechnology for both applications in metabolic engineering as well as process engineering and thus was successfully applied for process design (Bettenbrock et al., 2006; Chu and Constantinides, 1988; Kadir et al., 2010; Kotte et al., 2010; Moisset et al., 2012; Nolan and Lee, 2011; Oshiro et al., 2009; Sansonetti et al., 2011; Shinto et al., 2007; Usuda et al., 2010; Wang et al., 2014), strain engineering (Andreozzi et al., 2016; Khodayari and Maranas, 2016; Savoglidis et al., 2016), identification of drug targets (Bordbar et al., 2015; Haanstra et al., 2017; Murabito et al., 2011) or unravelling key regulatory interactions (Link et al., 2013; Saa and Nielsen, 2017, 2016).
Industrial biotechnology encompasses a large area of multi-scale and multi-disciplinary research activities. With the recent megatrend of digitalization sweeping across all industries, there is an increased focus in the biotechnology industry on developing, integrating and applying digital models to improve all aspects of industrial biotechnology. Given the rapid development of this field, we systematically classify the state-of-art modelling concepts applied at different scales in industrial biotechnology and critically discuss their current usage, advantages and limitations. Further, we critically analyzed current strategies to couple cell models with computational fluid dynamics to study the performance of industrial microorganisms in large-scale bioprocesses, which is of crucial importance for the bio-based production industries. One of the most challenging aspects in this context is gathering intracellular data under industrially relevant conditions. Towards comprehensive models, we discuss how different scale-down concepts combined with appropriate analytical tools can capture intracellular states of single cells. We finally illustrated how the efforts could be used to develop digitals models suitable for both cell factory design and process optimization at industrial scales in the future.
Misinterpretation risks of global stochastic optimisation of kinetic models revealed by multiple optimisation runs
2019, Mathematical Biosciences
One of use cases for metabolic network optimisation of biotechnologically applied microorganisms is the in silico design of new strains with an improved distribution of metabolic fluxes. Global stochastic optimisation methods (genetic algorithms, evolutionary programing, particle swarm and others) can optimise complicated nonlinear kinetic models and are friendly for unexperienced user: they can return optimisation results with default method settings (population size, number of generations and others) and without adaptation of the model. Drawbacks of these methods (stochastic behaviour, undefined duration of optimisation, possible stagnation and no guaranty of reaching optima) cause optimisation result misinterpretation risks considering the very diverse educational background of the systems biology and synthetic biology research community. Different methods implemented in the COPASI software package are tested in this study to determine their ability to find feasible solutions and assess the convergence speed to the best value of the objective function. Special attention is paid to the potential misinterpretation of results.
Optimisation methods are tested with additional constraints that can be introduced to ensure the biological feasibility of the resulting optimised design: (1) total enzyme activity constraint (called also amino acid pool constraint) to limit the sum of enzyme concentrations and (2) homeostatic constraint limiting steady state metabolite concentration corridor around the steady state concentrations of metabolites in the original model. Impact of additional constraints on the performance of optimisation methods and misinterpretation risks is analysed.
In Silico Prediction of Large-Scale Microbial Production Performance: Constraints for Getting Proper Data-Driven Models
2018, Computational and Structural Biotechnology Journal
Citation Excerpt :
Altogether, these interactions form a very complex regulatory network. Roughly, the plentitude of GRNs may be divided into three representative approaches: continuous models (in this case based on ordinary differential equations) [71, 86–91], Boolean models [92–95] and probabilistic models [96–99]. These and other methods, such as Petri nets, Bayesian networks or neural networks, have been extensively reviewed by Karlebach and Shamir [100] and Machado et al. [101].
Industrial bioreactors range from 10.000 to 700.000 L and characteristically show different zones of substrate availabilities, dissolved gas concentrations and pH values reflecting physical, technical and economic constraints of scale-up. Microbial producers are fluctuating inside the bioreactors thereby experiencing frequently changing micro-environmental conditions. The external stimuli induce responses on microbial metabolism and on transcriptional regulation programs. Both may deteriorate the expected microbial production performance in large scale compared to expectations deduced from ideal, well-mixed lab-scale conditions. Accordingly, predictive tools are needed to quantify large-scale impacts considering bioreactor heterogeneities. The review shows that the time is right to combine simulations of microbial kinetics with calculations of large-scale environmental conditions to predict the bioreactor performance. Accordingly, basic experimental procedures and computational tools are presented to derive proper microbial models and hydrodynamic conditions, and to link both for bioreactor modeling. Particular emphasis is laid on the identification of gene regulatory networks as the implementation of such models will surely gain momentum in future studies.
Kinetic modeling of cell metabolism for microbial production
2016, Journal of Biotechnology
Citation Excerpt :
Constrain-based models ignore reaction kinetics, therefore efforts to integrate kinetic expressions into constrain-based models were proposed, which allow reducing the gap between the stoichiometric constraint-based and kinetic modeling. In Machado et al. (2012) randomly generated parameter samples were used to create a set of steady-state solution for a central carbon metabolism model of E. coli. The steady-state solution of the kinetic model can be mapped into the flux bounds of the constraint-based model, restricting its solution space.
Kinetic models of cellular metabolism are important tools for the rational design of metabolic engineering strategies and to explain properties of complex biological systems. The recent developments in high-throughput experimental data are leading to new computational approaches for building kinetic models of metabolism.
Herein, we briefly survey the available databases, standards and software tools that can be applied for kinetic models of metabolism. In addition, we give an overview about recently developed ordinary differential equations (ODE)-based kinetic models of metabolism and some of the main applications of such models are illustrated in guiding metabolic engineering design. Finally, we review the kinetic modeling approaches of large-scale networks that are emerging, discussing their main advantages, challenges and limitations.
A Comparative Analysis of Dynamic Models of the Central Carbon Metabolism of Escherichia coli
2016, IFAC-PapersOnLine
Dynamic models of metabolism have been developed for a variety of systems and can be applied in metabolic engineering design and to understand the time-varying characteristics of the systems when exposed to different stimuli. Hereby we analyse and compare the most used and complete kinetic models available for the central carbon metabolism of E. coli. Stoichiometric and kinetic comparisons showed several differences, discrepancies and incoherence especially regarding the kinetic mechanisms assumed, parameters and units. Time course and steady-state simulations and also comparison with an experimental dataset put in evidence major differences regarding responses to the same stimulus. The results presented raise important questions regarding the need of using standard methodologies in dynamic model construction as well as in using experimental data for model validation.
MCR-ALS on metabolic networks: Obtaining more meaningful pathways
2015, Chemometrics and Intelligent Laboratory Systems
With the aim of understanding the flux distributions across a metabolic network, i.e. within living cells, Principal Component Analysis (PCA) has been proposed to obtain a set of orthogonal components (pathways) capturing most of the variance in the flux data. The problems with this method are (i) that no additional information can be included in the model, and (ii) that orthogonality imposes a hard constraint, not always reasonably. To overcome these drawbacks, here we propose to use a more flexible approach such as Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) to obtain this set of biological pathways through the network. By using this method, different constraints can be included in the model, and the same source of variability can be present in different pathways, which is reasonable from a biological standpoint. This work follows a methodology developed for Pichia pastoris cultures grown on different carbon sources, lately presented in González-Martínez et al. (2014). In this paper a different grey modelling approach, which aims to incorporate a priori knowledge through constraints on the modelling algorithms, is applied to the same case of study. The results of both models are compared to show their strengths and weaknesses.

View all citing articles on Scopus

View full text

Exploring the gap between dynamic and constraint-based models of metabolism

Abstract

Highlights

Introduction

Section snippets

Models

Results

Discussion

Conclusions

Acknowledgments

Biophys. J.

J. Theor. Biol.

Biophys. J.

Metab. Eng.

Metab. Eng.

Biophys. J.

J. Theor. Biol.

Metab. Eng.

OptKnock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization

Biotechnol. Bioeng.

Dynamic modeling of the central carbon metabolism of Escherichia coli

Biotechnol. Bioeng.

Metabolic flux balance analysis and the in silico analysis of Escherichia coli K-12 gene deletions

BMC Bioinformatics

Matrix formalism to describe functional states of transcriptional regulatory systems

PLoS Comput. Biol.

Including metabolite concentrations into flux balance analysis: thermodynamic realizability as a constraint on flux distributions in metabolic networks

BMC Syst. Biol.

Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth

Nature

Systems biology: a brief overview

Science

BioModels database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems

Nucleic Acids Res.