Identification of Novel Human Genes Evolutionarily Conserved in Caenorhabditis elegans by Comparative Proteomics

  1. Chun-Hung Lai1,
  2. Chang-Yuan Chou1,
  3. Lan-Yang Ch'ang1,
  4. Chung-Shyan Liu2, and
  5. Wen-chang Lin1,3
  1. 1Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan, Republic of China; 2Department of Information Engineering, Chung Yuan C. University, Chung Li, Taiwan, Republic of China

Abstract

Modern biomedical research greatly benefits from large-scale genome-sequencing projects ranging from studies of viruses, bacteria, and yeast to multicellular organisms, like Caenorhabditis elegans. Comparative genomic studies offer a vast array of prospects for identification and functional annotation of human ortholog genes. We presented a novel comparative proteomic approach for assembling human gene contigs and assisting gene discovery. The C. elegans proteome was used as an alignment template to assist in novel human gene identification from human EST nucleotide databases. Among the available 18,452 C. elegans protein sequences, our results indicate that at least 83% (15,344 sequences) of C. elegans proteome has human homologous genes, with 7,954 records ofC. elegans proteins matching known human gene transcripts. Only 11% or less of C. elegans proteome contains nematode-specific genes. We found that the remaining 7,390 sequences might lead to discoveries of novel human genes, and over 150 putative full-length human gene transcripts were assembled upon further database analyses.

[The sequence data described in this paper have been submitted to the GenBank data library under accession nos.AF132936AF132973, AF151799AF151909, and AF152097.]

Footnotes

  • 3 Corresponding author.

  • E-MAIL wenlin{at}ibms.sinica.edu.tw; FAX 886–2–2782–9142.

    • Received November 29, 1999.
    • Accepted March 9, 2000.
| Table of Contents

Preprint Server