Regular Article
Prototype Implementation of the Integrated Genomic Database

https://doi.org/10.1006/cbmr.1994.1011Get rights and content

Abstract

We aim to develop an open software system to handle human genome data. The system, called Integrated Genomic Database (IGD), will integrate information from many genomic databases and experimental resources into a comprehensive target-end database (IGD TED). Users will access front-end client systems (IGD FRED) to download data of interest to their computers and merge them with their own local data. FREDs will provide persistent storage of, and instant access to, retrieved data; a friendly graphical interface; tools for querying, browsing, analyzing, and editing local data; interface to external analysis; and tools for communicating with the outside world. The TED will be accessible over the network (online and offline) as a read-only resource for multiple clients. It collects data from major databases for nucleotide and protein sequences and structures, genome maps, experimental reagents, phenotypes, and bibliographic data, and sets of raw data produced at genome centers and laboratories. Beside character-based access via Gopher, WAIS, FTP, and several query language interfaces to the TED, we will develop a specialized front-end client, IGD FRED, with its own database manager, based on the ACEDB program. The FRED will support graphical display methods for sequence feature maps, chromosomal genetic and physical maps, and experimental objects like clone grids, etc. FRED will also provide an interface to important analysis software packages and tools for submitting data to external databases in their own format.

References (0)

Cited by (36)

  • Information technology solutions for integration of biomolecular and clinical data in the identification of new cancer biomarkers and targets for therapy

    2010, Pharmacology and Therapeutics
    Citation Excerpt :

    The central process is data warehousing which includes three steps: 1) to develop a unified model that can accommodate all the information from single databases, 2) transformation of the data into this model and loading to the data warehouse, and 3) retrieval of data from the source databases in one environment (‘one-stop shop’) as well as integrated access to the data requiring knowledge, that the individual sources cannot provide (Stein, 2003). This concept has been applied for a long time to many different (business) applications and also found its way into biology (Ritter et al., 1994; Schonbach et al., 2000; Hu et al., 2004; Kasprzyk et al., 2004; Shah et al., 2005; Lee et al., 2006; Rhodes et al., 2007). For an in-depth overview of data warehouse technology we refer the reader to Chaudhuri and Dayal (1997), Devlin (1997), and Inmon (2002).

  • Bioinformatics integration and agent technology

    2004, Journal of Biomedical Informatics
    Citation Excerpt :

    Note that with Icarus one can only represent the entries of the data sources; there is no conceptual model (schema) and like BioKleisli the integration and location details are not transparent to the user. The Integrated Genomic Database (IGD) [27] uses the Concise Object Query Language (COQL) and an Extended Entity Relation (EER) model for the CQL and CDM, respectively. One of the few systems that uses materialised federation, IGD, integrates more than a dozen data sources quite successfully, although its extensibility has been questioned [13].

  • Heterogeneous database integration in biomedicine

    2001, Journal of Biomedical Informatics
  • New approaches in data integration for systems chemical biology

    2013, Current Topics in Medicinal Chemistry
View all citing articles on Scopus
View full text