Trends in Genetics
Volume 22, Issue 1, January 2006, Pages 1-5
Journal home page for Trends in Genetics

Genome Analysis
Rapid evolution of noncoding RNAs: lack of conservation does not mean lack of function

https://doi.org/10.1016/j.tig.2005.10.003Get rights and content

The mammalian transcriptome contains many non-protein-coding RNAs (ncRNAs), but most of these are of unclear significance and lack strong sequence conservation, prompting suggestions that they might be non-functional. However, certain long functional ncRNAs such as Air and Xist are also poorly conserved. In this article, we systematically analyzed the conservation of several groups of functional ncRNAs, including miRNAs, snoRNAs and longer ncRNAs whose function has been either documented or confidently predicted. As expected, miRNAs and snoRNAs were highly conserved. By contrast, the longer functional non-micro, non-sno ncRNAs were much less conserved with many displaying rapid sequence evolution. Our findings suggest that longer ncRNAs are under the influence of different evolutionary constraints and that the lack of conservation displayed by the thousands of candidate ncRNAs does not necessarily signify an absence of function.

Introduction

Recent transcriptome analyses have shown that most of the metazoan genome is transcribed 1, 2. Using cDNA cloning 3, 4, 5, 6, 7, 8, 9 and genome tiling arrays 10, 11, 12, 13, 14, 15, 16, researchers have detected surprisingly large numbers of candidate non protein-coding RNAs (ncRNAs), many of which appear to be localized to the nucleus and/or lack polyadenylation [15]. The function – if any – of these transcripts remains a matter of debate 17, 18, the resolution of which will have important implications for our understanding of genome biology.

Wang and colleagues recently reported that non-protein-coding transcripts identified in mouse cDNA collections are poorly conserved and therefore argued that they are likely to be non-functional [19]. By contrast, many members of known functional classes of ncRNAs – including microRNAs (miRNAs) and small nucleolar RNAs (snoRNAs) – are well-conserved across a diverse range of species 4, 5, 6, 7. Candidate ncRNAs however average ∼2 kb in length 9, 20 and hence are different from miRNAs and snoRNAs (which are typically 21–25 nt and 60–300 nt, respectively), and it might be more meaningful to compare their evolution with longer ncRNAs whose functions have been validated. Well-studied large ncRNAs such as Xist and Air are poorly conserved, although there are small segments of their sequences that are highly conserved 21, 22, 23, 24, 25. Therefore, the possibility arises that different types of ncRNAs can exhibit different patterns of sequence conservation. Here we provide evidence that supports this hypothesis and discuss the possible interpretations and implications of our results.

Section snippets

Analysis of functional ncRNA conservation

There are many types of functional ncRNAs: transfer RNAs (tRNAs), ribosomal RNAs (rRNAs) and small nuclear (spliceosomal) RNAs have been known for many years and have core functions in RNA processing and translation. However, in recent years, it has become evident that there are increasing numbers of other ncRNAs that regulate gene expression in different ways in complex organisms, including miRNAs, snoRNAs and other miscellaneous ncRNAs (see Ref. [26] and references therein). These regulatory

Known miRNAs and snoRNAs are well-conserved but other ncRNAs are not

Most of the known miRNAs are highly conserved, with >90% sequence identity between human and mouse (Figure 1a). This level of purifying selection is even greater than that observed for mRNAs or even the protein-coding sequences within mRNAs (Figure 1c,i), which in turn are more highly conserved than introns (Figure 1f). Strong miRNA conservation is hardly surprising given that phylogenetic conservation has been a criterion used to identify miRNAs [31].

Most snoRNAs in mammals have been

Interpretations and implications

We have provided evidence that different classes of functional ncRNAs are evolving differently, compared with one another and with protein-coding sequences. Why should this be, and what is its significance?

The known miRNAs and snoRNAs are well conserved, and there are several probable reasons for this conservation. First, both function by hybridization to other nucleic acids. For miRNAs, which are short, a consequence is that even slight changes in sequence can fundamentally alter function.

Concluding remarks

Our findings have implications for the criteria used to assess the significance of candidate ncRNAs. As noted earlier, others have taken the lack of primary sequence conservation to mean that thousands of putative mouse ncRNAs are non-functional [19]. The evidence presented here indicates that this conclusion might be premature, and that alternative criteria to assess the function of longer ncRNA candidates are needed. It has been suggested that folding energy predictions can be used to

Update

Recently, Willingham et al. [46] used sequence homology to identify 500 human orthologs of mouse ncRNAs identified by RIKEN. They designed siRNAs for functional screens and using cell-based reporter assays, identified a new ncRNA repressor of the nuclear transcription factor of activated T cells (NFAT), in addition to others that appear to be either essential for cell viability or involved in hedgehog signaling 46, 47. The identification of patches of conserved sequences between ncRNAs can be

Acknowledgements

This work was supported by research grants from the Australian Research Council to J.S.M. M.C.F. is a University of Queensland Postdoctoral Fellow. K.C.P. is supported by an NHMRC Medical Postgraduate Research Scholarship. We thank Constance Cepko and Tracy Young for providing us with information on TUG1 ncRNAs; Melanie Ginger and Jeff Rosen for providing a pre-print of their paper on PINC/GB7 ncRNA; and Albin Sandelin for assistance with Figure 1. We also thank the editor and two anonymous

References (47)

  • M. Lagos-Quintana

    Identification of novel genes coding for small expressed RNAs

    Science

    (2001)
  • R.C. Lee et al.

    An extensive class of small RNAs in Caenorhabditis elegans

    Science

    (2001)
  • Y. Okazaki

    Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs

    Nature

    (2002)
  • T. Ota

    Complete sequencing and characterization of 21,243 full-length human cDNAs

    Nat. Genet.

    (2004)
  • P. Kapranov

    Large-scale transcriptional activity in chromosomes 21 and 22

    Science

    (2002)
  • D. Kampa

    Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22

    Genome Res.

    (2004)
  • V. Stolc

    A gene expression map for the euchromatic genome of Drosophila melanogaster

    Science

    (2004)
  • P. Bertone

    Global identification of human transcribed sequences with genome tiling arrays

    Science

    (2004)
  • J. Cheng

    Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution

    Science

    (2005)
  • P. Kapranov

    Examples of the complex architecture of the human transcriptome revealed by RACE and high density tiling arrays

    Genome Res.

    (2005)
  • J. Wang

    Mouse transcriptome: Neutral evolution of ‘non-coding’ complementary DNAs

    Nature

    (2004)
  • K. Numata

    Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection

    Genome Res.

    (2003)
  • T.B. Nesterova

    Characterization of the genomic Xist locus in rodents reveals conservation of overall gene structure and tandem repeats but rapid evolution of unique sequence

    Genome Res.

    (2001)
  • Cited by (531)

    • G-quadruplex from precursor miR-1587 modulated its maturation and function

      2023, International Journal of Biological Macromolecules
    • A review on the role of LINC00173 in human cancers

      2023, Pathology Research and Practice
    • RNA out of the mist

      2023, Trends in Genetics
    View all citing articles on Scopus
    *

    These authors contributed equally to this work.

    View full text