Genome sequence of Haloarcula marismortui: A halophilic archaeon from the Dead Sea

  1. Nitin S. Baliga1,7,
  2. Richard Bonneau1,
  3. Marc T. Facciotti1,
  4. Min Pan1,
  5. Gustavo Glusman1,
  6. Eric W. Deutsch1,
  7. Paul Shannon1,
  8. Yulun Chiu2,
  9. Rueyhung Sting Weng3,
  10. Rueichi Richie Gan2,
  11. Pingliang Hung3,
  12. Shailesh V. Date4,6,
  13. Edward Marcotte4,
  14. Leroy Hood1, and
  15. Wailap Victor Ng3,5,7
  1. 1 Institute for Systems Biology, Seattle, Washington 98103, USA
  2. 2 Institute of Biochemistry, National Yang Ming University, Taipei 112, Taiwan
  3. 3 Institute of Biotechnology in Medicine, National Yang Ming University, Taipei 112, Taiwan
  4. 4 Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
  5. 5 Institute of Bioinformatics and Department of Biotechnology and Laboratory Science in Medicine, National Yang Ming University, Taipei 112, Taiwan
  6. 6 Center for Bioinformatics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA

Abstract

We report the complete sequence of the 4,274,642-bp genome of Haloarcula marismortui, a halophilic archaeal isolate from the Dead Sea. The genome is organized into nine circular replicons of varying G+C compositions ranging from 54% to 62%. Comparison of the genome architectures of Halobacterium sp. NRC-1 and H. marismortui suggests a common ancestor for the two organisms and a genome of significantly reduced size in the former. Both of these halophilic archaea use the same strategy of high surface negative charge of folded proteins as means to circumvent the salting-out phenomenon in a hypersaline cytoplasm. A multitiered annotation approach, including primary sequence similarities, protein family signatures, structure prediction, and a protein function association network, has assigned putative functions for at least 58% of the 4242 predicted proteins, a far larger number than is usually achieved in most newly sequenced microorganisms. Among these assigned functions were genes encoding six opsins, 19 MCP and/or HAMP domain signal transducers, and an unusually large number of environmental response regulators—nearly five times as many as those encoded in Halobacterium sp. NRC-1—suggesting H. marismortui is significantly more physiologically capable of exploiting diverse environments. In comparing the physiologies of the two halophilic archaea, in addition to the expected extensive similarity, we discovered several differences in their metabolic strategies and physiological responses such as distinct pathways for arginine breakdown in each halophile. Finally, as expected from the larger genome, H. marismortui encodes many more functions and seems to have fewer nutritional requirements for survival than does Halobacterium sp. NRC-1.

Footnotes

  • [Supplemental material is available online at www.genome.org and http://halo.systemsbiology.net. The sequence data from this study have been submitted to GenBank under accession nos. AY59290–AY59298].

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2700304.

  • 7 Corresponding authors. E-mail nbaliga{at}systemsbiology.org; fax (206) 132-1299. E-mail wvng{at}ym.edu.tw; fax 886-2-2826-4092.

    • Accepted August 12, 2004.
    • Received April 21, 2004.
| Table of Contents

Preprint Server