Græmlin: General and robust alignment of multiple large interaction networks

  1. Jason Flannick1,4,
  2. Antal Novak1,4,
  3. Balaji S. Srinivasan2,3,
  4. Harley H. McAdams2, and
  5. Serafim Batzoglou1,5
  1. 1Department of Computer Science, Stanford University, Stanford, California 94305, USA;
  2. 2Department of Developmental Biology, Stanford University, Stanford, California 94305, USA;
  3. 3Department of Electrical Engineering, Stanford University, Stanford, California 94305, USA
    1. 4 These authors contributed equally to this work.

    Abstract

    The recent proliferation of protein interaction networks has motivated research into network alignment: the cross-species comparison of conserved functional modules. Previous studies have laid the foundations for such comparisons and demonstrated their power on a select set of sparse interaction networks. Recently, however, new computational techniques have produced hundreds of predicted interaction networks with interconnection densities that push existing alignment algorithms to their limits. To find conserved functional modules in these new networks, we have developed Græmlin, the first algorithm capable of scalable multiple network alignment. Græmlin's explicit model of functional evolution allows both the generalization of existing alignment scoring schemes and the location of conserved network topologies other than protein complexes and metabolic pathways. To assess Græmlin's performance, we have developed the first quantitative benchmarks for network alignment, which allow comparisons of algorithms in terms of their ability to recapitulate the KEGG database of conserved functional modules. We find that Græmlin achieves substantial scalability gains over previous methods while improving sensitivity.

    Footnotes

    • 5 Corresponding author.

      5 E-mail serafim{at}cs.stanford.edu; fax (650) 725-1449.

    • Supplemental material is available online at www.genome.org. Græmlin is available at http://graemlin.stanford.edu, and source code is available under the GNU Public License.

    • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5235706. Freely available online through the Genome Research Open Access option.

      • Received February 17, 2006.
      • Accepted May 18, 2006.
    • Freely available online through the Genome Research Open Access option.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server