Elsevier

Virus Research

Volume 134, Issues 1–2, June 2008, Pages 221-234
Virus Research

The diversity of retrotransposons and the properties of their reverse transcriptases

https://doi.org/10.1016/j.virusres.2007.12.010Get rights and content

Abstract

A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Four major classes of retrotransposons are described here. First, the long-terminal-repeat (LTR) retrotransposons have similar structures and mechanisms to those of the vertebrate retroviruses. Genes that may enable these retrotransposons to leave a cell have been acquired by these elements in a number of animal and plant lineages. Second, the tyrosine recombinase retrotransposons are similar to the LTR retrotransposons except that they have substituted a recombinase for the integrase and recombine into the host chromosomes. Third, the non-LTR retrotransposons use a cleaved chromosomal target site generated by an encoded endonuclease to prime reverse transcription. Finally, the Penelope-like retrotransposons are not well understood but appear to also use cleaved DNA or the ends of chromosomes as primer for reverse transcription. Described in the second part of this review are the enzymatic properties of the reverse transcriptases (RTs) encoded by retrotransposons. The RTs of the LTR retrotransposons are highly divergent in sequence but have similar enzymatic activities to those of retroviruses. The RTs of the non-LTR retrotransposons have several unique properties reflecting their adaptation to a different mechanism of retrotransposition.

Introduction

Vertebrate retroviruses represent but one lineage of an ever-growing family of mobile genetic elements that utilize reverse transcriptase to generate a DNA copy from their RNA transcript. While a few of these other lineages, such as hepadnaviruses and caulimoviruses, are true viruses the largest number of lineages are classified as retrotransposable elements, or retrotransposons. The first retrotransposons to be identified were discovered because they caused mutations in two favorite model organisms: yeast, Saccharomyces cerevisiae, and the fruitfly, Drosophila melanogaster. The sequences of these elements revealed long-terminal repeats (LTRs) and open reading frames that encoded reverse transcriptase, RNase H, integrase, proteinase and gag-like proteins in an organization that was suggestive of retroviruses (Clare and Farabaugh, 1985, Mount and Rubin, 1985). Elegant experiments demonstrated that the yeast element made new copies by reverse transcription of their RNA transcripts (Boeke et al., 1985). These retrotransposons, however, did not encode a protein similar to retroviral envelope (env) genes and did not spread between individuals in a population. The retrotransposons were viewed as possible progenitors of the retroviruses, or alternatively as descendants of the retroviruses by loss of their envelope gene. Today the number of characterized retrotransposons has expanded dramatically and many new examples of elements with different putative env genes have been found. The wide diversity of retrotransposons compared to the limited diversity of vertebrate retroviruses suggests the ancestral forms were retrotransposons.

Without an env-like gene retrotransposons are unable to leave the environment of one cell for another cell, thus they must insert into the chromosomes of the germ cells to insure passage to the next generation. The inability to leave an organism also means that retrotransposons must be more circumspect than a virus in how often they replicate due to the potential damage caused by their insertion into the host genome. Any insertion that significantly reduces the fitness of the host will be lost from the population. Given this constraint, it is remarkable that large numbers of retrotransposon families using a variety of mechanisms to reverse transcribe and insert their genetic information into a genome have become highly successful in every lineage of eukaryotic organisms.

Even more remarkable than their diversity is the abundance of retrotransposons in most organisms. Indeed, the reason why many eukaryotic genomes are so enormous in size is because of the accumulation of retrotransposable elements. For example, retrotransposons constitute 42% of the human genome (Lander et al., 2001) and 75% of the maize genome (SanMiguel et al., 1998). Even these percentages are underestimates because the scrambling of DNA sequences by mutation, recombination and continued retrotransposon insertions make the oldest insertions impossible to recognize. Only those organisms that need to replicate their DNA quickly, or have found recombinational mechanisms to remove insertions, appear to be able to prevent the accumulation of elements over time (Charlesworth et al., 1994).

In the following sections we describe the major classes of retrotransposons that are known today emphasizing their structure, their phylogenetic relationship to each other and their mechanism of retrotransposition. Finally, for those retrotransposons where the reverse transcriptase have been studied, we compare the properties of their reverse transcriptases (RTs) with that of retroviral RTs.

Section snippets

The use of RT sequences to evaluate the relationship of retrotransposons

Determining the relationships between the different classes of retrotransposons has been challenging. Grouping elements by their common structural features and mechanism of insertion works well for those groups that have uniform structures and well-defined mechanisms of integration. However, as will be described below there are few shared features for some groups of retrotransposons and our knowledge of their mechanism of integration is limited. A second approach classifies elements based on

LTR retrotransposons

Based on the phylogeny of their RT domains (Fig. 1) the LTR retrotransposons can be divided into major lineages that are historically referred to as the Ty1/copia group, the Bel group and the Ty3/gypsy group. Ty1 and Ty3 are well-characterized elements from S. cerevisiae, while Copia, Bel and Gypsy are elements from D. melanogaster. These lineages have recently been classified by the International Committee on the Taxonomy of Viruses into two major groups: the Pseudoviridae with three genera,

Properties of the reverse transcriptases from LTR retrotransposons

Most studies of the RTs from LTR retrotransposons have involved direct comparisons of their activities to that of retroviral RTs. To date all studies have been conducted with elements from S. cerevisiae and S. pombe. Even though these elements encode proteins that are highly divergent in sequence from the retroviral enzymes, they exhibit remarkably similar properties.

Bombyx mori R2

Biochemical studies of R2 RT have all been conducted with the entire ORF of the element from Bombyx mori expressed and purified from E. coli (Luan et al., 1993). The purified protein is 120-kDa in size and was found to have the RNA and DNA binding properties and enzymatic activities that gave rise to the retrotransposition mechanism shown in Fig. 2. During these studies of R2 retrotransposition, R2 RT was shown to have a number of unusual properties that differentiate it from the RTs encoded by

Concluding remarks

It is likely that we have only scratched the surface in documenting the variety and the distribution of retrotransposons within eukaryotic genomes. New elements belonging to each of the four groups of retrotransposons as well as completely new groups of elements will no doubt be discovered in the massive amounts of repetitive DNA present in most genomes. These elements have played a major role in determining the size and composition of eukaryotic genomes, and are responsible for much of their

Acknowledgements

We are very appreciative of the insightful comments made by Danna Eickbush on various drafts of this manuscript, as well as the suggestions of several reviewers. Our work on R2 specifically and of retrotransposon evolution in general has been with support from the National Institutes of Health (GM42790) and from the National Science Foundation (MCB0544071).

References (124)

  • M. Kajikawa et al.

    LINEs mobilize SINEs in the eel through a shared 3′ sequence

    Cell

    (2002)
  • N. Kirshenboim et al.

    Expression and characterization of a novel reverse transcriptase of the LTR retrotransposon Tf1

    Virology

    (2007)
  • A. Kurzynska-Kokorniak et al.

    DNA-directed DNA polymerase and strand displacement activity of the reverse transcriptase encoded by the R2 retrotransposon

    J. Mol. Biol.

    (2007)
  • J. Kuzio et al.

    Sequence and analysis of the genome of a baculovirus pathogenic for Lymantria dispar

    Virology

    (1999)
  • M. Lapadat-Tapolsky et al.

    Possible roles of HIV-1 nucleocapsid protein in the specificity of proviral DNA synthesis and in its variability

    J. Mol. Biol.

    (1997)
  • G.J. Latham et al.

    Vertical-scanning mutagenesis of a critical tryptophan in the “minor groove binding track” of HIV-1 reverse transcriptase. Major groove DNA adducts identify specific protein interactions in the minor groove

    J. Biol. Chem.

    (2000)
  • H.L. Levin

    It's prime time for reverse transcriptase

    Cell

    (1997)
  • H.A. Lorenzi et al.

    The VIPER elements of trypanosomes constitute a novel group of tyrosine recombinase-enconding retrotransposons

    Mol. Biochem. Parasitol.

    (2006)
  • D.D. Luan et al.

    Reverse transcription of R2Bm is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition

    Cell

    (1993)
  • F. Martin et al.

    Characterization of a non-long terminal repeat retrotransposon cDNA (L1Tc) from Trypanosoma cruzi: homology of the first ORF with the APE family of DNA repair enzymes

    J. Mol. Biol.

    (1995)
  • J.V. Moran et al.

    High frequency retrotransposition in cultured mammalian cells

    Cell

    (1996)
  • T.M. Nakamura et al.

    Reversing time: origin of telomerase

    Cell

    (1998)
  • M. Olivares et al.

    The open reading frame 1 of the L1Tc retrotransposon of Trypanosoma cruzi codes for a protein with apurinic–apyrimidinic nuclease activity

    J. Biol. Chem.

    (1997)
  • M. Olivares et al.

    The L1Tc, long interspersed nucleotide element from Trypanosoma cruzi, encodes a protein with 3′-phosphatase and 3′-phosphodiesterase enzymatic activities

    J. Biol. Chem.

    (1999)
  • M. Olivares et al.

    The non-LTR (long terminal repeat) retrotransposon L1Tc from Trypanosoma cruzi codes for a protein with RNase H activity

    J. Biol. Chem.

    (2002)
  • T. Anzai et al.

    Sequence specific recognition and cleavage of telomeric repeats by the endonuclease of non-long terminal repeat retrotransposon TRAS1

    Mol. Cell. Biol.

    (2001)
  • T. Anzai et al.

    Functional roles of 3′-terminal structures of template RNA during in vivo retrotransposition on non-LTR retrotransposon, R1Bm

    Nucleic Acids Res.

    (2005)
  • I.R. Arkhipova et al.

    Retroelements containing introns in diverse invertebrate taxa

    Nat. Genet.

    (2003)
  • A. Atwood-Moore et al.

    Specific recognition and cleavage of the plus-strand primer by reverse transcriptase

    J. Virol.

    (2005)
  • A. Atwood-Moore et al.

    The self primer of the long terminal repeat retrotransposon Tf1 is not removed during reverse transcription

    J. Virol.

    (2006)
  • A. Bateman et al.

    Pfam 3.1: 1313 multiple alignments match the majority of proteins

    Nucleic Acids Res.

    (1999)
  • A. Bibillo et al.

    Functional roles of carboxylate residues comprising the DNA polymerase active site triad of Ty3 reverse transcriptase

    Nucleic Acids Res.

    (2005)
  • J.D. Boeke

    Transposable elements in Saccharomyces cerevisiae

  • J.D. Boeke et al.

    Retrotransposons, endogenous retroviruses, and the evolution of retroelements

  • Boeke, J.D., Eickbush, T.H., Sandmeyer, S.B., Voytas, D.F., 2005. Pseudoviridae. In: Fauquet, C.M., Mayo, M.A.,...
  • M. Boutabout et al.

    DNA synthesis fidelity by the reverse transcriptase of the yeast retrotransposon Ty1

    Nucleic Acids Res.

    (2001)
  • N.J. Bowen et al.

    Genomic analysis of Caenorhabditis elegans reveals ancient families of retroviral-like elements

    Genome Res.

    (1999)
  • W.J. Britt et al.

    Human cytomegalovirus glycoproteins

    Intervirology

    (1996)
  • W.D. Burke et al.

    Are retrotransposons long-term hitchhikers

    Nature

    (1998)
  • P. Capy et al.

    Dynamics and Evolution of Transposable Elements

    (1998)
  • M.C. Chaboissier et al.

    Retrotransposition of the I factor, a non-long terminal repeat retrotransposon of Drosophila, generates tandem repeats at the 3′ end

    Nucleic Acids Res.

    (2000)
  • B. Charlesworth et al.

    The evolutionary dynamics of repetitive DNA in eukaryotes

    Nature

    (1994)
  • S. Christensen et al.

    Target specificity of the endonuclease from Xenopus laevis non-long terminal repeat retrotransposons, Tx1L

    Mol. Cell. Biol.

    (2000)
  • S.M. Christensen et al.

    Role of the Bombyx mori R2 element N-terminal domain in the target-primed reverse transcription (TPRT) reaction

    Nucleic Acids Res.

    (2005)
  • S.M. Christensen et al.

    RNA from the 5’ end of the R2 retrotransposon controls R2 protein binding to and cleavage of its DNA target site

    Proc. Natl. Acad. Sci. U.S.A.

    (2006)
  • J. Clare et al.

    Nucleotide sequence of a yeast Ty element: evidence for an unusal mechanism of gene expression

    Proc. Natl. Acad. Sci. U.S.A.

    (1985)
  • G.J. Cost et al.

    Targeting of human retrotransposons integration is directed by the specificity of the L1 endonuclease for regions of unusual DNA structure

    Biochemistry

    (1998)
  • G.J. Cost et al.

    Human L1 element target-primed reverse transcription in vitro

    EMBO J.

    (2002)
  • M. Dewannieux et al.

    LINE-mediated retrotransposition of marked Alu sequences

    Nat. Genet.

    (2003)
  • R.F. Doolittle et al.

    Origins and evolutionary relationships of retroviruses

    Quart. Rev. Biol.

    (1989)
  • Cited by (201)

    • The first draft genome assembly and data analysis of the Malaysian mahseer (Tor tambroides)

      2023, Aquaculture and Fisheries
      Citation Excerpt :

      The third classification is non-LTR retrotransposons with the absence of inverted or tandem terminal repeats. These have poly(A) tails at 3′ end and variable deletions at 5′ ends which encode open reading frame (ORF), which are prone to mutation (Eickbush & Jamburuthugoda, 2008). For instance, these elements include LINEs (long interspersed nucleotide elements) and SINEs (short interspersed nucleotide elements).

    • Reverse transcriptase

      2023, Metalloenzymes: From Bench to Bedside
    View all citing articles on Scopus
    View full text