#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

A New Isolation with Migration Model along Complete Genomes Infers Very Different Divergence Processes among Closely Related Great Ape Species


We present a hidden Markov model (HMM) for inferring gradual isolation between two populations during speciation, modelled as a time interval with restricted gene flow. The HMM describes the history of adjacent nucleotides in two genomic sequences, such that the nucleotides can be separated by recombination, can migrate between populations, or can coalesce at variable time points, all dependent on the parameters of the model, which are the effective population sizes, splitting times, recombination rate, and migration rate. We show by extensive simulations that the HMM can accurately infer all parameters except the recombination rate, which is biased downwards. Inference is robust to variation in the mutation rate and the recombination rate over the sequence and also robust to unknown phase of genomes unless they are very closely related. We provide a test for whether divergence is gradual or instantaneous, and we apply the model to three key divergence processes in great apes: (a) the bonobo and common chimpanzee, (b) the eastern and western gorilla, and (c) the Sumatran and Bornean orang-utan. We find that the bonobo and chimpanzee appear to have undergone a clear split, whereas the divergence processes of the gorilla and orang-utan species occurred over several hundred thousands years with gene flow stopping quite recently. We also apply the model to the Homo/Pan speciation event and find that the most likely scenario involves an extended period of gene flow during speciation.


Vyšlo v časopise: A New Isolation with Migration Model along Complete Genomes Infers Very Different Divergence Processes among Closely Related Great Ape Species. PLoS Genet 8(12): e32767. doi:10.1371/journal.pgen.1003125
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1003125

Souhrn

We present a hidden Markov model (HMM) for inferring gradual isolation between two populations during speciation, modelled as a time interval with restricted gene flow. The HMM describes the history of adjacent nucleotides in two genomic sequences, such that the nucleotides can be separated by recombination, can migrate between populations, or can coalesce at variable time points, all dependent on the parameters of the model, which are the effective population sizes, splitting times, recombination rate, and migration rate. We show by extensive simulations that the HMM can accurately infer all parameters except the recombination rate, which is biased downwards. Inference is robust to variation in the mutation rate and the recombination rate over the sequence and also robust to unknown phase of genomes unless they are very closely related. We provide a test for whether divergence is gradual or instantaneous, and we apply the model to three key divergence processes in great apes: (a) the bonobo and common chimpanzee, (b) the eastern and western gorilla, and (c) the Sumatran and Bornean orang-utan. We find that the bonobo and chimpanzee appear to have undergone a clear split, whereas the divergence processes of the gorilla and orang-utan species occurred over several hundred thousands years with gene flow stopping quite recently. We also apply the model to the Homo/Pan speciation event and find that the most likely scenario involves an extended period of gene flow during speciation.


Zdroje

1. LiH, DurbinR (2011) Inference of human population history from individual whole-genome sequences. Nature 475: 493–496.

2. BurgessR, YangZ (2008) Estimation of hominoid ancestral population sizes under bayesian coalescent models incorporating mutation rate variation and sequencing errors. Mol Biol Evol 25: 1979–1994.

3. WangY, HeyJ (2010) Estimating divergence parameters with small samples from a large number of loci. Genetics 184: 363–379.

4. HeyJ (2010) Isolation with migration models for more than two populations. Mol Biol Evol 27: 905–920.

5. YangZ (2010) A likelihood ratio test of speciation with gene ow using genomic sequence data. Genome Biology and Evolution 2: 200–211.

6. GronauI, HubiszMJ, GulkoB, DankoCG, SiepelA (2011) Bayesian inference of ancient human demography from individual genome sequences. Nature Genetics 43: 1031–1034.

7. ZhuT, YangZ (2012) Maximum Likelihood Implementation of an Isolation-with-Migration Model with Three Species for Testing Speciation with Gene Flow. Mol Biol Evol

8. BecquetC, PrzeworskiM (2007) A new approach to estimate parameters of speciation models with application to apes. Genome Res 17: 1505–1519.

9. WiufC, HeinJ (1999) Recombination as a point process along sequences. Theor Popul Biol 55: 248–59.

10. McVeanGAT, CardinNJ (2005) Approximating the coalescent with recombination. Philosophical transactions of the Royal Society of London Series B, Biological sciences 360: 1387–1393.

11. MarjoramP, WallJD (2006) Fast “coalescent” simulation. BMC genetics 7: 16.

12. ChenGK, MarjoramP, WallJD (2009) Fast and exible simulation of DNA sequence data. Genome Research 19: 136–142.

13. HobolthA, ChristensenOF, MailundT, SchierupMH (2007) Genomic relationships and speciation times of human, chimpanzee, and gorilla inferred from a coalescent hidden markov model. PLoS Genet 3: e7.

14. DutheilJY, GanapathyG, HobolthA, MailundT, UyenoyamaMK, et al. (2009) Ancestral population genomics: The coalescent hidden markov model approach. Genetics 183: 259–274.

15. MailundT, DutheilJY, HobolthA, LunterG, SchierupMH (2011) Estimating divergence time and ancestral effective population size of bornean and sumatran orangutan subspecies using a coalescent hidden markov model. PLoS Genet 7: e1001319.

16. PaulJS, SteinruckenM, SongYS (2011) An accurate sequentially Markov conditional sampling distribution for the coalescent with recombination. Genetics 187: 1115–1128.

17. HobolthA, DutheilJY, HawksJ, SchierupMH, MailundT (2011) Incomplete lineage sorting patterns among human, chimpanzee, and orangutan suggest recent orangutan speciation and widespread selection. Genome Research 21: 349–356.

18. KongA, ThorleifssonG, GudbjartssonDF, MassonG, SigurdssonA, et al. (2010) Fine-scale recombination rate differences between sexes, populations and individuals. Nature 467: 1099–1103.

19. PrüferK, MunchK, HellmannI, AkagiK, MillerJR, et al. (2012) The bonobo genome compared with the chimpanzee and human genomes. Nature 486: 527–531.

20. ScallyA, DutheilJY, HillierLW, JordanGE, GoodheadI, et al. (2012) Insights into hominid evolution from the gorilla genome sequence. Nature 483: 169–175.

21. LockeDP, HillierLW, WarrenWC, WorleyKC, NazarethLV, et al. (2011) Comparative and demographic analysis of orang-utan genomes. Nature 469: 529–533.

22. ScallyA, DurbinR (2012) Revising the human mutation rate: implications for understanding human evolution. Nat Rev Genet 13: 745–753.

23. LangergraberKEK, PrüferKK, RowneyCC, BoeschCC, CrockfordCC, et al. (2012) Generation times in wild chimpanzees and gorillas suggest earlier divergence times in great ape and human evolution. PNAS 109: 15716–15721.

24. KongA, FriggeML, MassonG, BesenbacherS, SulemP, et al. (2012) Rate of de novo mutations and the importance of father's age to disease risk. Nature 488: 471–475.

25. ThalmannO, FischerA, LankesterF, PääboS, VigilantL (2007) The complex evolutionary history of gorillas: insights from genomic data. Mol Biol Evol 24: 146–58.

26. SteiperME (2006) Population history, biogeography, and taxonomy of orangutans (genus: Pongo) based on a population genetic meta-analysis of multiple loci. J Hum Evol 50: 509–22.

27. WonYJ, HeyJ (2005) Divergence population genetics of chimpanzees. Mol Biol Evol 22: 297–307.

28. HeyJ (2010) The divergence of chimpanzee species and subspecies as revealed in multipopulation isolation-with-migration analyses. Mol Biol Evol 27: 921–933.

29. Myers ThompsonJA (2003) A model of the biogeographical journey from Proto-pan to Pan paniscus. Primates; journal of primatology 44: 191–197.

30. StankiewiczJ, de WitMJ (2006) A proposed drainage evolution model for Central Africa–Did the Congo ow east? Journal of African Earth Sciences 44: 75–84.

31. GreenRE, KrauseJ, BriggsAW, MaricicT, StenzelU, et al. (2010) A draft sequence of the Neandertal genome. Science (New York, NY) 328: 710–722.

32. ReichD, GreenRE, KircherM, KrauseJ, PattersonN, et al. (2010) Genetic history of an archaic hominin group from Denisova Cave in Siberia. Nature 468: 1053–1060.

33. HammerMF, WoernerAE, MendezFL, WatkinsJC, WallJD (2011) Genetic evidence for archaic admixture in Africa. Proceedings Of The National Academy Of Sciences Of The United States Of America 108: 15123–15128.

34. ReichD, PattersonN, KircherM, DelfinF, NandineniMR, et al. (2011) Denisova admixture and the first modern human dispersals into southeast Asia and oceania. American journal of human genetics 89: 516–528.

35. SankararamanSS, PattersonNN, LiHH, PääboSS, ReichDD (2012) The Date of Interbreeding between Neandertals and Modern Humans. PLoS Genet 8: e1002947.

36. PattersonN, RichterDJ, GnerreS, LanderES, ReichD (2006) Genetic evidence for complex speciation of humans and chimpanzees. Nature 441: 1103–1108.

37. WakeleyJ (2008) Complex speciation of humans and chimpanzees. Nature 452: E3–4; discussion E4.

38. PresgravesDC, YiSV (2009) Doubts about complex speciation between humans and chimpanzees. Trends in ecology and evolution 24: 533–540.

39. WebsterMT (2009) Patterns of autosomal divergence between the human and chimpanzee genomes support an allopatric model of speciation. Gene 443: 70–75.

40. YamamichiMM, GojoboriJJ, InnanHH (2011) An autosomal analysis gives no genetic evidence for complex speciation of humans and chimpanzees. Mol Biol Evol 29: 145–156.

41. Mailund T, Halager A, Westergaard M (2012) Using Colored Petri Nets to Construct Coalescent Hidden Markov Models: Automatic Translation from Demographic Specifications to Efficient Inference Methods. In: Haddad S, Pomello L, editors, Application and Theory of Petri Nets. Springer Berlin/Heidelberg, pp. 32–50.

42. NachmanMW, PayseurBA (2012) Recombination rate variation and speciation: theoretical predictions and empirical results from rabbits and mice. Philosophical transactions of the Royal Society of London Series B, Biological sciences 367: 409–421.

43. NadeauNJ, WhibleyA, JonesRT, DaveyJW, DasmahapatraKK, et al. (2012) Genomic islands of divergence in hybridizing Heliconius butteries identified by large-scale targeted sequencing. Philosophical transactions of the Royal Society of London Series B, Biological sciences 367: 343–353.

44. JanoušekV, WangL, LuzynskiK, DufkováP, VyskočilováMM, et al. (2012) Genome-wide architecture of reproductive isolation in a naturally occurring hybrid zone between Mus musculus musculus and M. m. domesticus. Molecular Ecology 21: 3032–3047.

45. Hein J, Schierup MH, Wiuf C (2005) Gene genealogies, variation and evolution: A primer in coalescent theory. Oxford university press.

46. SimonsenK, ChurchillG (1997) A Markov Chain Model of Coalescence with Recombination. Theor Popul Biol 52: 43–59.

47. SlatkinM, PollackJL (2006) The concordance of gene trees and species trees at two linked loci. Genetics 172: 1979–1984.

48. HobolthA, AndersenLN, MailundT (2011) On computing the coalescence time density in an isolation-with-migration model with few samples. Genetics 187: 1241–1243.

49. TavareS (1979) Note on finite homogeneous continuous-time Markov-chains. Biometrics 35: 831–834.

50. Sand A, Pedersen C, Mailund T, Brask A (2010) HMMlib: A C++ library for general hidden Markov models exploiting modern CPUs. In: Proceedings of the 2nd International Workshop on High Performance Computational Systems Biology. IEEE, pp. 126–134.

51. MailundT, SchierupMH, PedersenCNS, MechlenborgPJM, MadsenJN, et al. (2005) CoaSim: a exible environment for simulating genetic data under coalescent models. BMC Bioinformatics 6: 252.

52. DutheilJ, BoussauB (2008) Non-homogeneous models of sequence evolution in the Bio++ suite of libraries and programs. BMC Evol Biol 8: 255.

53. LiH, DurbinR (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760.

54. LunterG, GoodsonM (2011) Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Research 21: 936–939.

55. LiH, HandsakerB, WysokerA, FennellT, RuanJ, et al. (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics 25: 2078–2079.

56. Harris RS (2007) Improved Pairwise Alignment of Genomic DNA. Dissertation, Pennsylvania State University.

57. KentWJ, BaertschR, HinrichsA, MillerW, HausslerD (2003) Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proceedings of the National Academy of Sciences of the United States of America 100: 11484–9.

58. DreszerTR, KarolchikD, ZweigAS, HinrichsAS, RaneyBJ, et al. (2012) The UCSC genome browser database: extensions and updates 2011. Nucleic acids research 40: D918–923.

59. BlanchetteM, KentWJ, RiemerC, ElnitskiL, SmitAFA, et al. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Research 14: 708–715.

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2012 Číslo 12
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Aktuální možnosti diagnostiky a léčby litiáz
nový kurz
Autori: MUDr. Tomáš Ürge, PhD.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#