Addressing the challenges of multiscale model management in systems biology

doi:10.1016/j.compchemeng.2006.10.004

Computers & Chemical Engineering

Volume 31, Issue 8, 15 August 2007, Pages 962-979

https://doi.org/10.1016/j.compchemeng.2006.10.004 Get rights and content

Abstract

Mathematical and computational modelling are emerging as important techniques for studying the behaviour of complex biological systems. We argue that two advances are necessary to properly leverage these techniques: firstly, the ability to integrate models developed and executed on separate tools, without the need for substantial translation and secondly, a comprehensive system for storing and man-ageing not only the models themselves but also the parameters and tools used to execute those models and the results they produce. A framework for modelling with these features is described here. We have developed of a suite of XML-based services used for the storing and analysis of models, model parameters and results, and tools for model integration. We present these here, and evaluate their effectiveness using a worked example based on part of the hepatocyte glycogenolysis system.

Introduction

Modelling physiology is in many ways similar to the modelling of process systems so there is much that chemical engineers can contribute. As with process systems, one of the major challenges in computational physiology is to efficiently integrate existing computational models which describe phenomena associated with a variety of spatial and temporal scales. Such models can be deterministic, stochastic, qualitative, or in many other forms. An important part of this challenge is the storage, collation, and retrieval of models, along with their integration.

Our work (The UCL Beacon Project, 2002–2007) is part of the UK Department of Trade and Industry sponsored Beacon program, focused on harnessing genomics. We aim to build in-silico models that represent aspects of behaviour of the human liver, an epithelial organ. The methodology and modelling system should then be extendable to other epithelial organs. In building a fully integrated model of the liver, existing models of various components must be used along with newly devised models. Our approach is therefore to develop a system for the orchestration and integration of models. Not only will this system permit the development of integrated models which could not otherwise be constructed, it will also support the development of these models in a manner which increases the computational efficiency and reliability of those models, and reduces the time taken for such development.

The framework we have developed supports two key aspects of biological modelling: model integration across different scales, and the interconnection of the distinct components in biological systems. Interconnections are largely based on signalling i.e. the transport and reaction of chemicals between distinct components that drive the physiological system. Using this framework we aim in the project to develop a simulation environment in which a wide variety of models are integrated and exploited within a common domain of interest. These models may be at different levels of abstraction, may deploy different representations, and may focus on different interacting phenomena. Validation may give rise to model variants that will require management.

Our project will result in a system to integrate models addressing phenomena from the level of individual gene and cell features through tissue and organ models. Models at every level of the structure will be integrated, validated, and exploited using a plethora of mathematical, computational and experimental techniques. Fig. 1 shows the hierarchy of levels of signalling activity in many physiological systems.

One of the fundamental issues in model integration is how to handle the intrinsic inter-relationships between different models in an efficient way. Individual models are built up in an isolated biological environment relative to the real physiology and the purpose of linking different models is to recover the physiological conditions in terms of the context the models cover. Our computational framework for linking biological models will take account of the intrinsic couplings existing among the models, while allowing the flexibility that comes from being able to ‘plug’ in different choices of model, and link models which take different approaches to modelling, or which apply to different scales of consideration.

In this paper, we shall review existing work on computational infrastructure for systems biology, argue that two areas of software engineering (information management and encapsulation) should in particular be brought to bear upon the problem and describe a series of software modules we have authored that together constitute a complete computational environment for systems biology. In particular, the system supports the integration of models built in very different software environments while leaving the authoring and execution of the component models within those environments. We provide evidence for the effectiveness of our technique using an example model of part of the response of the liver to adrenaline, where one of the component models is built in Mathematica, and another in X-Phase-Plane-Auto (XPPAUT).

Section snippets

The state of the art

Much current modelling work in biology does not take into account the potential plethora of different models nor how to ‘orchestrate’ them. Integration mechanisms are at the program code level. A good example is the work on the heart carried out by Denis Noble and his team (Noble, 2002). Other groups are also attempting to take a more considered approach to model integration, and we review some related work here.

Metamodelling

In order to understand biological modelling, we have modelled the elements involved in model construction and validation, thus elucidating a biological metamodel. This comprehensive “metamodel” (Finkelstein et al., 2004), underpins the development of the tools presented in this paper so it is reviewed here.

The metamodel representation developed by the project shown in Fig. 2 uses an ‘entity-relationship’ (ER) modelling approach (first presented in Chen (1976)) and presents an entity class (of

Modularity

We construct biological models by connecting together existing smaller models of individual phenomena. This approach has many advantages – if the component models are well understood and have been individually well-tested then much of this confidence should carry over to the larger model. It also has disadvantages – there may be subtle incompatibilities between models which invalidate their integration. Our approach to building software to support model integration has been to try to leverage

The need for information management

Another important and well-established software engineering paradigm has regard to the careful management of the information pertaining to an endevour—the field of information management. At the moment, there is little standard practise in how data is recorded for use in biological modelling. Parameters are collected from the literature and recorded in an ad hoc fashion using notebooks or small-scale computing solutions. The tools used to execute models are installed and configured in many

Integration framework

Fig. 3 shows an overview of our model integration framework, intended to facilitate a modular approach to systems biology modelling, with an emphasis on information management. Note that in Fig. 3, there are only two models. This is a simplified view, appropriate to the example model used later in this paper, see Appendix A.1. A composite model can possess much more complex topology consisting of many models and connectors—our framework has been used to support a seven-element composite model,

Run-time information flow

The user launches the model run manager (1) and points it at a composite model definition file (2). The user also chooses parameters, and the MRM builds from them a parameter run file (3) pointing to values in the parameter database (4). The MRM launches an orchestrator (5), which uses the CMDL file (6), to find (7) metadata files for the individual models, and, from them (8) the model definition files. It then instantiates (9) models and their engines, based on (10) those definition files, and

An example system

The system we have chosen to use to illustrate and test our techniques is based on existing models of hormone-stimulated hepatocyte glycogenolysis. This important physiological process is the means by which energy, in the form of glucose, is released from storage in the liver in humans and other animals. It constitutes one part of the glucose homeostasis system by which blood sugar levels are maintained within acceptable limits. Fig. 6 shows a cartoon of the main features of the pathway that

Conclusions

We have presented a model integration framework for systems biology, with an architecture based on an orchestrator, wrappers, connectors, and information services. We have built many software components which together constitute an implementation of this system. By the development of our two-model example we have demonstrated some of the advantages of our approach, which brings well-established benefits of modern software engineering techniques to systems biology. Our aim is multiscale

Acknowledgement

We gratefully acknowledge the funding of the United Kingdom Department of Trade and Industry (DTI).

References (42)

B. Bayer et al.
Towards integrated information models for data and documents
Computers and Chemical Engineering
(2004)
J. Belaud et al.
Open software architecture for process simulation: The current status of cape-open standard
Computer Aided Chemical Engineering
(2002)
T. Hofer
Model of intercellular calcium oscillations in hepatocytes: Synchronization of heterogeneous cells
Biophysical Journal
(1999)
U. Kummer et al.
Switching from simple to complex oscillations in calcium signaling
Biophysical Journal
(2000)
C. Lloyd et al.
CellML: Its future, present and past
Progress in Biophysics and Molecular Biology
(2004)
L. Loew et al.
The virtual cell: A software environment for computational cell biology
Trends in Biotechnology
(2001)
T.A. Riccobene et al.
Modeling activation and desensitization of G-protein coupled receptors provides insight into ligand efficiency
Journal of Theoretical Biology
(1999)
I. Schomburg et al.
Brenda: a resource for enzyme data and metabolic information
Trends in Biochemical Sciences
(2002)
G. Schuler et al.
Entrez: Molecular biology database and retrieval system
Methods in Enzymology
(1996)
M. Antoniotti et al.
Model building and model checking for biochemical processes
Cell Biochemistry and Biophysics
(2003)

M. Ashburner et al.

Gene ontology: tool for the unification of biology

Nature Genetics

(2000)

Box, D., Ehnebuske, D., Kakivaya, G., & Layman, A. (2000). Simple object access protocol (SOAP) 1.1. W3C...

K. Burrage

Parallel and sequential methods for ordinary differential equations

(1995)

A. Campbell et al.

Dynamic information architecture system: An advanced simulation framework for military and civilian applications.

Society for Computer Simulation International, Simulation Series

(1998)

P. Chen

The entity-relationship model—Toward a unified view of data

ACM Transactions on Database Systems

(1976)

Christensen, E., Curbera, F., & Meredith, G. (2001). Web services description language (WSDL)...

Ermentrout, B. (2000)....

A. Finkelstein et al.

Computational challenges of systems biology

IEEE Computer

(2004)

J. Hetherington et al.

Simplification and its consequences in biological modelling: Conclusions from a study of calcium oscillations in hepatocytes

Journal of the Royal Society: Interface

(2005)

M. Hucka et al.

The erato systems biology workbench: Enabling interaction and exchange between software tools for systems biology

Proceedings Pacific Symposium on Biocomputing

(2002)

M. Hucka et al.

The systems biology markup language (SBML): A medium for representation and exchange of biochemical models

Bioinformatics

(2003)

Cited by (19)

Towards computer-aided multiscale modelling: A generic supporting environment for model realization and execution
2012, Computers and Chemical Engineering
Citation Excerpt :
A more recent application of CAPE-OPEN standards is reported by Zitney (2010), where an advanced process engineering software tool, termed a “co-simulator”, has been developed which supports the integration of steady-state process simulation with other multiphysics-based equipment simulations (e.g. CFD) in order to get high-fidelity simulation results for designing and analysing energy systems. In some other areas such as biological systems modelling (e.g. Hetherington, Bogle, & Saffrey, 2007; Hunter et al., 2005) and multi-physics modelling (e.g. Smirnov, 2004), there have been similar developments in supporting the combined use of different models and simulation tools. Senin, Wallace, and Borl (2003) developed a distributed object modelling environment (DOME), which supports the integration of different modelling tools involved in various aspects or stages of a (mechanical) engineering design project.
Computer-aided multiscale modelling (CAMM) may be carried out in three consecutive stages, namely conceptual modelling, model realization, and model execution. Following earlier work on a conceptual modelling tool which aims to support the first stage of CAMM, prototypical tools for realizing conceptual models and for the execution of simulation are developed in this work, with the assumption that a multiscale simulation is to be carried out by means of integrating existing single-scale models. More specifically, the tool that supports model realisation helps modellers generate information required for executing the multiscale model. The model execution stage is in turn supported by a component-based simulation environment. Two different multiscale simulation modes, namely “coordinator driven” and “master tool driven”, are identified and supported separately. Details of the design and implementation of these tools are provided. Two reactor modelling examples are used to validate these tools and to demonstrate their application.
Towards computer-aided multiscale modelling: An overarching methodology and support of conceptual modelling
2012, Computers and Chemical Engineering
Citation Excerpt :
The result of CAPE-OPEN has been vital for the improvement of openness in commercial process modelling software, and for the development of open simulation platforms such as CHEOPS (Schopfer, Yang, Wedel, & Marquardt, 2004). In some other areas such as biological systems modelling (e.g. Hetherington, Bogle, & Saffrey, 2007; Hunter et al., 2005) and multi-physics modelling (e.g. Smirnov, 2004), there have been similar developments in supporting the combined use of different models and simulation tools. Starting with the relevant development of CAPM as outlined above, two different sets of challenging issues must be addressed in the process of marching towards CAMM.
Multiscale modelling is now widely regarded as a promising and powerful tool in various disciplines, including the broad area of process engineering. However, a multiscale model is usually much more difficult to develop than a single-scale model due to a range of conceptual, numerical, and software challenges. Currently, there is little support developed to facilitate multiscale modelling. This paper discusses the key challenges faced by computer-aided multiscale modelling (CAMM) and presents a methodology for developing a computer-based, generic and open supporting framework for multiscale modelling. Details are particularly provided on the development of a conceptual modelling tool, an important element of the envisaged tool set for CAMM. The application of this tool is illustrated by two reactor modelling examples.
The role of Computer Aided Process Engineering in physiology and clinical medicine
2010, Computers and Chemical Engineering
Citation Excerpt :
These repositories encourage the use of the standards described above. Another approach has been to develop a system that allows heterogeneous models to be integrated using computational wrappers to enable communication between models within a model management system (Hetherington et al., 2007). This is not unlike the CAPE-OPEN approach well known in the CAPE community (www.colan.org).
This paper discusses the potential role for Computer Aided Process Engineering (CAPE) in developing engineering analysis and design approaches to biological systems across multiple levels—cell signalling networks, gene, protein and metabolic networks, cellular systems, through to physiological systems. The 21st Century challenge in the Life Sciences is to bring together widely dispersed models and knowledge in order to enable a system-wide understanding of these complex systems. This systems level understanding should have broad clinical benefits. Computer Aided Process Engineering can bring systems approaches to (i) improving understanding of these complex chemical and physical (particularly molecular transport in complex flow regimes) interactions at multiple scales in living systems, (ii) analysis of these models to help to identify critical missing information and to explore the consequences on major output variables resulting from disturbances to the system, and (iii) ‘design’ potential interventions in in vivo systems which can have significant beneficial, or potentially harmful, effects which need to be understood. This paper develops these three themes drawing on recent projects at UCL. The first project has modeled the effects of blood flow on endothelial cells lining arteries, taking into account cell shape change resulting in changes in the cell skeleton which cause consequent chemical changes. A second is a project which is building an in silico model of the human liver, tieing together models from the molecular level to the liver. The composite model models glucose regulation in the liver and associated organs. Both projects involve molecular transport, chemical reactions, and complex multiscale systems, tackled by approaches from CAPE.
Chemical Engineers solve multiple scale problems in manufacturing processes – from molecular scale through unit operations scale to plant-wide and enterprise wide systems – so have an appropriate skill set for tackling problems in physiology and clinical medicine, in collaboration with life and clinical scientists.
Cause-and-effect analysis in chemical processes utilizing XML, plant connectivity and quantitative process history
2009, Computers and Chemical Engineering
Disturbances that spread plant-wide in a chemical process pose challenges to maintenance staff. Connections within the plant and the presence of multiple causal paths mean it is not straightforward to locate the root disturbance because the effects can propagate and be detected elsewhere. Measurement-based methods use quantitative process history to generate hypotheses about the root cause, while a separate strand of work in the literature has used causal maps and digraphs. It has been reported that both approaches can give spurious solutions, however. The idea behind this article is to reduce the number of spurious solutions by combining basic and readily available information about the connectivity of the process with the results from causal measurement-based analysis. Connectivity information is captured from an XML description of the process schematic that complies with the CAEX schema. The capabilities of the approach and its potential for future development are discussed.
SAPHIR: A physiome core model of body fluid homeostasis and blood pressure regulation
2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
Models and computational strategies linking physiological response to molecular networks from large-scale data
2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

View all citing articles on Scopus

View full text