Positive Selection and Multiple Losses of the LINE-1-Derived Gene in Mammals Suggest a Dual Role in Genome Defense and Pluripotency
Transposable elements comprise major portions of most animal genomes and are selfish genetic elements that may encode proteins needed for their own spread to new genomic locations. Though often considered genomic parasites, these elements also occasionally create novel genes that prove beneficial to the host, a process called 'domestication'. Here, we describe the evolution of a gene, L1TD1, which is derived from the protein-coding regions of the L1 mobile element family. We show that L1TD1 was born in the common ancestor of placental mammals. L1TD1 expression in stem cells and its requirement to maintain the pluripotent state of human embryonic stem cells suggested it might have been originally domesticated for such a pluripotency role. We find that L1TD1's evolution does not fit with the predictions of this model; in fact, L1TD1 has rapidly evolved in primates and mice and has been lost several times in mammals. We suggest an alternate model that L1TD1 was born as a means to defend genomes against transposable elements, perhaps L1 itself. We propose that following this initial domestication, L1TD1 later became incorporated into pluripotency programs in some mammalian lineages.
Vyšlo v časopise:
Positive Selection and Multiple Losses of the LINE-1-Derived Gene in Mammals Suggest a Dual Role in Genome Defense and Pluripotency. PLoS Genet 10(9): e32767. doi:10.1371/journal.pgen.1004531
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1004531
Souhrn
Transposable elements comprise major portions of most animal genomes and are selfish genetic elements that may encode proteins needed for their own spread to new genomic locations. Though often considered genomic parasites, these elements also occasionally create novel genes that prove beneficial to the host, a process called 'domestication'. Here, we describe the evolution of a gene, L1TD1, which is derived from the protein-coding regions of the L1 mobile element family. We show that L1TD1 was born in the common ancestor of placental mammals. L1TD1 expression in stem cells and its requirement to maintain the pluripotent state of human embryonic stem cells suggested it might have been originally domesticated for such a pluripotency role. We find that L1TD1's evolution does not fit with the predictions of this model; in fact, L1TD1 has rapidly evolved in primates and mice and has been lost several times in mammals. We suggest an alternate model that L1TD1 was born as a means to defend genomes against transposable elements, perhaps L1 itself. We propose that following this initial domestication, L1TD1 later became incorporated into pluripotency programs in some mammalian lineages.
Zdroje
1. PardueML, DeBaryshePG (2011) Retrotransposons that maintain chromosome ends. Proc Natl Acad Sci USA 108: 20317–20324.
2. CordauxR, BatzerMA (2009) The impact of retrotransposons on human genome evolution. Nat Rev Genet 10: 691–703.
3. BrosiusJ, GouldSJ (1992) On "genomenclature": a comprehensive (and respectful) taxonomy for pseudogenes and other "junk DNA". Proc Natl Acad Sci U S A 89: 10706–10710.
4. FeschotteC (2008) Transposable elements and the evolution of regulatory networks. Nat Rev Genet 9: 397–405.
5. SmitAF (1999) Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr Opin Genet Dev 9: 657–663.
6. SinzelleL, IzsvakZ, IvicsZ (2009) Molecular domestication of transposable elements: from detrimental parasites to useful host genes. Cell Mol Life Sci 66: 1073–1093.
7. MiS, LeeX, LiX, VeldmanGM, FinnertyH, et al. (2000) Syncytin is a captive retroviral envelope protein involved in human placental morphogenesis. Nature 403: 785–789.
8. BlackSG, ArnaudF, PalmariniM, SpencerTE (2010) Endogenous retroviruses in trophoblast differentiation and placental development. Am J Reprod Immunol 64: 255–264.
9. DupressoirA, LavialleC, HeidmannT (2012) From ancestral infectious retroviruses to bona fide cellular genes: role of the captured syncytins in placentation. Placenta 33: 663–671.
10. DupressoirA, VernochetC, BawaO, HarperF, PierronG, et al. (2009) Syncytin-A knockout mice demonstrate the critical role in placentation of a fusogenic, endogenous retrovirus-derived, envelope gene. Proc Natl Acad Sci USA 106: 12127–12132.
11. OnoR, NakamuraK, InoueK, NaruseM, UsamiT, et al. (2006) Deletion of Peg10, an imprinted gene acquired from a retrotransposon, causes early embryonic lethality. Nat Genet 38: 101–106.
12. SuzukiS, OnoR, NaritaT, PaskAJ, ShawG, et al. (2007) Retrotransposon silencing by DNA methylation can drive mammalian genomic imprinting. PLoS Genet 3: e55.
13. BenitL, De ParsevalN, CasellaJF, CallebautI, CordonnierA, et al. (1997) Cloning of a new murine endogenous retrovirus, MuERV-L, with strong similarity to the human HERV-L element and with a gag coding sequence closely related to the Fv1 restriction gene. J Virol 71: 5652–5657.
14. BestS, Le TissierP, TowersG, StoyeJP (1996) Positional cloning of the mouse retrovirus restriction gene Fv1. Nature 382: 826–829.
15. YanY, Buckler-WhiteA, WollenbergK, KozakCA (2009) Origin, antiviral function and evidence for positive selection of the gammaretrovirus restriction gene Fv1 in the genus Mus. Proc Natl Acad Sci USA 106: 3259–3263.
16. JungYT, KozakCA (2000) A single amino acid change in the murine leukemia virus capsid gene responsible for the Fv1(nr) phenotype. J Virol 74: 5385–5387.
17. KozakCA, ChakrabortiA (1996) Single amino acid changes in the murine leukemia virus capsid protein gene define the target of Fv1 resistance. Virology 225: 300–305.
18. StevensA, BockM, EllisS, LeTissierP, BishopKN, et al. (2004) Retroviral capsid determinants of Fv1 NB and NR tropism. J Virol 78: 9592–9598.
19. KozakCA (1985) Analysis of wild-derived mice for Fv-1 and Fv-2 murine leukemia virus restriction loci: a novel wild mouse Fv-1 allele responsible for lack of host range restriction. J Virol 55: 281–285.
20. MalikHS, BurkeWD, EickbushTH (1999) The age and evolution of non-LTR retrotransposable elements. Mol Biol Evol 16: 793–805.
21. BrosiusJ (1999) RNAs from all categories generate retrosequences that may be exapted as novel genes or regulatory elements. Gene 238: 115–134.
22. FeschotteC, PrithamEJ (2007) DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet 41: 331–368.
23. VolffJN (2006) Turning junk into gold: domestication of transposable elements and the creation of new genes in eukaryotes. Bioessays 28: 913–922.
24. KuwabaraT, HsiehJ, MuotriA, YeoG, WarashinaM, et al. (2009) Wnt-mediated activation of NeuroD1 and retro-elements during adult neurogenesis. Nat Neurosci 12: 1097–1105.
25. Lev-MaorG, SorekR, ShomronN, AstG (2003) The birth of an alternatively spliced exon: 3' splice-site selection in Alu exons. Science 300: 1288–1291.
26. SorekR, AstG, GraurD (2002) Alu-containing exons are alternatively spliced. Genome Res 12: 1060–1067.
27. LorencA, MakalowskiW (2003) Transposable elements and vertebrate protein diversity. Genetica 118: 183–191.
28. SelaN, KimE, AstG (2010) The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates. Genome Biol 11: R59.
29. IwashitaS, UenoS, NakashimaK, SongSY, OhshimaK, et al. (2006) A tandem gene duplication followed by recruitment of a retrotransposon created the paralogous bucentaur gene (bcntp97) in the ancestral ruminant. Mol Biol Evol 23: 798–806.
30. MitsuiK, TokuzawaY, ItohH, SegawaK, MurakamiM, et al. (2003) The homeoprotein Nanog is required for maintenance of pluripotency in mouse epiblast and ES cells. Cell 113: 631–642.
31. WongRC, IbrahimA, FongH, ThompsonN, LockLF, et al. (2011) L1TD1 is a marker for undifferentiated human embryonic stem cells. PLoS ONE 6: e19355.
32. NarvaE, RahkonenN, EmaniMR, LundR, PursiheimoJP, et al. (2012) RNA-binding protein L1TD1 interacts with LIN28 via RNA and is required for human embryonic stem cell self-renewal and cancer cell proliferation. Stem Cells 30: 452–460.
33. IwabuchiKA, YamakawaT, SatoY, IchisakaT, TakahashiK, et al. (2011) ECAT11/L1td1 is enriched in ESCs and rapidly activated during iPSC generation, but it is dispensable for the maintenance and induction of pluripotency. PLoS ONE 6: e20461.
34. MartinSL (2006) The ORF1 protein encoded by LINE-1: structure and function during L1 retrotransposition. Journal of biomedicine & biotechnology 2006: 45621.
35. WeiW, GilbertN, OoiSL, LawlerJF, OstertagEM, et al. (2001) Human L1 retrotransposition: cis preference versus trans complementation. Mol Cell Biol 21: 1429–1439.
36. KulpaDA, MoranJV (2006) Cis-preferential LINE-1 reverse transcriptase activity in ribonucleoprotein particles. Nat Struct Mol Biol 13: 655–660.
37. DaughertyMD, MalikHS (2012) Rules of engagement: molecular insights from host-virus arms races. Annu Rev Genet 46: 677–700.
38. KhanH, SmitA, BoissinotS (2006) Molecular evolution and tempo of amplification of human LINE-1 retrotransposons since the origin of primates. Genome Res 16: 78–87.
39. NielsenR, YangZ (1998) Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148: 929–936.
40. BoissinotS, FuranoAV (2001) Adaptive evolution in LINE-1 retrotransposons. Mol Biol Evol 18: 2186–2194.
41. Kosakovsky PondSL, PosadaD, GravenorMB, WoelkCH, FrostSD (2006) Automated phylogenetic detection of recombination using a genetic algorithm. Mol Biol Evol 23: 1891–1901.
42. GibbsRA, RogersJ, KatzeMG, BumgarnerR, WeinstockGM, et al. (2007) Evolutionary and biomedical insights from the rhesus macaque genome. Science 316: 222–234.
43. ChiaNY, ChanYS, FengB, LuX, OrlovYL, et al. (2010) A genome-wide RNAi screen reveals determinants of human embryonic stem cell identity. Nature 468: 316–320.
44. CantrellMA, ScottL, BrownCJ, MartinezAR, WichmanHA (2008) Loss of LINE-1 activity in the megabats. Genetics 178: 393–404.
45. GrahnRA, RinehartTA, CantrellMA, WichmanHA (2005) Extinction of LINE-1 activity coincident with a major mammalian radiation in rodents. Cytogenet Genome Res 110: 407–415.
46. CantrellMA, EdererMM, EricksonIK, SwierVJ, BakerRJ, et al. (2005) MysTR: an endogenous retrovirus family in mammals that is undergoing recent amplifications to unprecedented copy numbers. J Virol 79: 14698–14707.
47. EricksonIK, CantrellMA, ScottL, WichmanHA (2011) Retrofitting the genome: L1 extinction follows endogenous retroviral expansion in a group of muroid rodents. J Virol 85: 12315–12323.
48. Smit AF, Hubley R, Green P (1996–2004) RepeatMasker Open-3.0. http://www.repeatmasker.org.
49. LanderES, LintonLM, BirrenB, NusbaumC, ZodyMC, et al. (2001) Initial sequencing and analysis of the human genome. Nature 409: 860–921.
50. OnoR, ShiuraH, AburataniH, KohdaT, Kaneko-IshinoT, et al. (2003) Identification of a large novel imprinted gene cluster on mouse proximal chromosome 6. Genome Res 13: 1696–1705.
51. EsnaultC, CornelisG, HeidmannO, HeidmannT (2013) Differential evolutionary fate of an ancestral primate endogenous retrovirus envelope gene, the EnvV syncytin, captured for a function in placentation. PLoS Genet 9: e1003400.
52. ZamudioN, Bourc'hisD (2010) Transposable elements in the mammalian germline: a comfortable niche or a deadly trap? Heredity (Edinb) 105: 92–104.
53. DewannieuxM, EsnaultC, HeidmannT (2003) LINE-mediated retrotransposition of marked Alu sequences. Nat Genet 35: 41–48.
54. BaoW, JurkaJ (2010) Origin and evolution of LINE-1 derived "half-L1" retrotransposons (HAL1). Gene 465: 9–16.
55. KhazinaE, TruffaultV, ButtnerR, SchmidtS, ColesM, et al. (2011) Trimeric structure and flexibility of the L1ORF1 protein in human L1 retrotransposition. Nat Struct Mol Biol 18: 1006–1014.
56. WolfD, GoffSP (2009) Embryonic stem cells use ZFP809 to silence retroviral DNAs. Nature 458: 1201–1204.
57. RoweHM, TronoD (2011) Dynamic control of endogenous retroviruses during development. Virology 411: 273–287.
58. RebolloR, FarivarS, MagerDL (2012) C-GATE - catalogue of genes affected by transposable elements. Mobile DNA 3: 9.
59. RebolloR, RomanishMT, MagerDL (2012) Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu Rev Genet 46: 21–42.
60. JordanIK, RogozinIB, GlazkoGV, KooninEV (2003) Origin of a substantial fraction of human regulatory sequences from transposable elements. Trends Genet 19: 68–72.
61. SzakST, PickeralOK, MakalowskiW, BoguskiMS, LandsmanD, et al. (2002) Molecular archeology of L1 insertions in the human genome. Genome Biol 3: R52.
62. LohseM, BolgerAM, NagelA, FernieAR, LunnJE, et al. (2012) RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics. Nucleic Acids Res 40: W622–627.
63. FujitaPA, RheadB, ZweigAS, HinrichsAS, KarolchikD, et al. (2011) The UCSC Genome Browser database: update 2011. Nucleic Acids Res 39: D876–882.
64. SedlazeckFJ, ReschenederP, von HaeselerA (2013) NextGenMap: fast and accurate read mapping in highly polymorphic genomes. Bioinformatics 29: 2790–2791.
65. HaasBJ, PapanicolaouA, YassourM, GrabherrM, BloodPD, et al. (2013) De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 8: 1494–1512.
66. AltschulSF, MaddenTL, SchafferAA, ZhangJ, ZhangZ, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
67. LiH, HandsakerB, WysokerA, FennellT, RuanJ, et al. (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics 25: 2078–2079.
68. McGinnisS, MaddenTL (2004) BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 32: W20–25.
69. PerelmanP, JohnsonWE, RoosC, SeuanezHN, HorvathJE, et al. (2011) A molecular phylogeny of living primates. PLoS Genet 7: e1001342.
70. YangZ (1997) PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci 13: 555–556.
71. Kosakovsky PondSL, FrostSDW (2005) Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection. Mol Biol Evol 22: 1208–1222.
72. MurrellB, MoolaS, MabonaA, WeighillT, ShewardD, et al. (2013) FUBAR: a fast, unconstrained bayesian approximation for inferring selection. Mol Biol Evol 30: 1196–1205.
73. GentlemanRC, CareyVJ, BatesDM, BolstadB, DettlingM, et al. (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 5: R80.
74. SonnhammerEL, DurbinR (1995) A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 167: GC1–10.
75. LarkinMA, BlackshieldsG, BrownNP, ChennaR, McGettiganPA, et al. (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23: 2947–2948.
76. JurkaJ, KapitonovVV, PavlicekA, KlonowskiP, KohanyO, et al. (2005) Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res 110: 462–467.
77. DarribaD, TaboadaGL, DoalloR, PosadaD (2012) jModelTest 2: more models, new heuristics and parallel computing. Nat Methods 9: 772.
78. GuindonS, GascuelO (2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52: 696–704.
79. TamuraK, PetersonD, PetersonN, StecherG, NeiM, et al. (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28: 2731–2739.
80. DarribaD, TaboadaGL, DoalloR, PosadaD (2011) ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27: 1164–1165.
81. KumarS, HedgesSB (2011) TimeTree2: species divergence times on the iPhone. Bioinformatics 27: 2023–2024.
82. AlmeidaFC, GianniniNP, DeSalleR, SimmonsNB (2011) Evolutionary relationships of the old world fruit bats (Chiroptera, Pteropodidae): another star phylogeny? BMC Evol Biol 11: 281.
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2014 Číslo 9
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Admixture in Latin America: Geographic Structure, Phenotypic Diversity and Self-Perception of Ancestry Based on 7,342 Individuals
- Nipbl and Mediator Cooperatively Regulate Gene Expression to Control Limb Development
- Genome Wide Association Studies Using a New Nonparametric Model Reveal the Genetic Architecture of 17 Agronomic Traits in an Enlarged Maize Association Panel
- Histone Methyltransferase MMSET/NSD2 Alters EZH2 Binding and Reprograms the Myeloma Epigenome through Global and Focal Changes in H3K36 and H3K27 Methylation