Systematic genome-wide annotation of spliceosomal proteins reveals differential gene family expansion

  1. Nuno L. Barbosa-Morais1,2,
  2. Maria Carmo-Fonseca2, and
  3. Samuel Aparício1,3,4
  1. 1 University of Cambridge, Department of Oncology, Hutchison-MRC Research Centre, Cambridge CB2 2XZ, United Kingdom
  2. 2 Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Av. Prof. Egas Moniz, 1649-028 Lisboa, Portugal

Abstract

Although more than 200 human spliceosomal and splicing-associated proteins are known, the evolution of the splicing machinery has not been studied extensively. The recent near-complete sequencing and annotation of distant vertebrate and chordate genomes provides the opportunity for an exhaustive comparative analysis of splicing factors across eukaryotes. We describe here our semiautomated computational pipeline to identify and annotate splicing factors in representative species of eukaryotes. We focused on protein families whose role in splicing is confirmed by experimental evidence. We visually inspected 1894 proteins and manually curated 224 of them. Our analysis shows a general conservation of the core spliceosomal proteins across the eukaryotic lineage, contrasting with selective expansions of protein families known to play a role in the regulation of splicing, most notably of SR proteins in metazoans and of heterogeneous nuclear ribonucleoproteins (hnRNP) in vertebrates. We also observed vertebrate-specific expansion of the CLK and SRPK kinases (which phosphorylate SR proteins), and the CUG-BP/CELF family of splicing regulators. Furthermore, we report several intronless genes amongst splicing proteins in mammals, suggesting that retrotransposition contributed to the complexity of the mammalian splicing apparatus.

Footnotes

  • [Supplemental material is available online at www.genome.org.]

  • Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.3936206.

  • 3 Present address: BC Cancer Agency, Vancouver V5Z 1L3, Canada.

  • 4 Corresponding author. E-mail saparicio{at}bccrc.ca; fax 1 604-675-8219.

    • Accepted August 9, 2005.
    • Received March 15, 2005.
| Table of Contents

Preprint Server