#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Estimating Divergence Time and Ancestral Effective Population Size of Bornean and Sumatran Orangutan Subspecies Using a Coalescent Hidden Markov Model


Due to genetic variation in the ancestor of two populations or two species, the divergence time for DNA sequences from two populations is variable along the genome. Within genomic segments all bases will share the same divergence—because they share a most recent common ancestor—when no recombination event has occurred to split them apart. The size of these segments of constant divergence depends on the recombination rate, but also on the speciation time, the effective population size of the ancestral population, as well as demographic effects and selection. Thus, inference of these parameters may be possible if we can decode the divergence times along a genomic alignment. Here, we present a new hidden Markov model that infers the changing divergence (coalescence) times along the genome alignment using a coalescent framework, in order to estimate the speciation time, the recombination rate, and the ancestral effective population size. The model is efficient enough to allow inference on whole-genome data sets. We first investigate the power and consistency of the model with coalescent simulations and then apply it to the whole-genome sequences of the two orangutan sub-species, Bornean (P. p. pygmaeus) and Sumatran (P. p. abelii) orangutans from the Orangutan Genome Project. We estimate the speciation time between the two sub-species to be thousand years ago and the effective population size of the ancestral orangutan species to be , consistent with recent results based on smaller data sets. We also report a negative correlation between chromosome size and ancestral effective population size, which we interpret as a signature of recombination increasing the efficacy of selection.


Vyšlo v časopise: Estimating Divergence Time and Ancestral Effective Population Size of Bornean and Sumatran Orangutan Subspecies Using a Coalescent Hidden Markov Model. PLoS Genet 7(3): e32767. doi:10.1371/journal.pgen.1001319
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1001319

Souhrn

Due to genetic variation in the ancestor of two populations or two species, the divergence time for DNA sequences from two populations is variable along the genome. Within genomic segments all bases will share the same divergence—because they share a most recent common ancestor—when no recombination event has occurred to split them apart. The size of these segments of constant divergence depends on the recombination rate, but also on the speciation time, the effective population size of the ancestral population, as well as demographic effects and selection. Thus, inference of these parameters may be possible if we can decode the divergence times along a genomic alignment. Here, we present a new hidden Markov model that infers the changing divergence (coalescence) times along the genome alignment using a coalescent framework, in order to estimate the speciation time, the recombination rate, and the ancestral effective population size. The model is efficient enough to allow inference on whole-genome data sets. We first investigate the power and consistency of the model with coalescent simulations and then apply it to the whole-genome sequences of the two orangutan sub-species, Bornean (P. p. pygmaeus) and Sumatran (P. p. abelii) orangutans from the Orangutan Genome Project. We estimate the speciation time between the two sub-species to be thousand years ago and the effective population size of the ancestral orangutan species to be , consistent with recent results based on smaller data sets. We also report a negative correlation between chromosome size and ancestral effective population size, which we interpret as a signature of recombination increasing the efficacy of selection.


Zdroje

1. SiepelA

2009 Perspective: Phylogenomics of primates and their ancestral populations. In review

2. PattersonN

RichterDJ

GnerreS

LanderES

ReichD

2006 Genetic evidence for complex speciation of humans and chimpanzees. Nature 441 1103 1108

3. BecquetC

PrzeworskiM

2007 A new approach to estimate parameters of speciation models with application to apes. Genome Res 17 1505 1519

4. RannalaB

YangZ

2003 Bayes estimation of species divergence times and ancestral population sizes using dna sequences from multiple loci. Genetics 164 1645 1656

5. HobolthA

ChristensenOF

MailundT

SchierupMH

2007 Genomic relationships and speciation times of human, chimpanzee, and gorilla inferred from a coalescent hidden markov model. PLoS Genet 3 e7 doi:10.1371/journal.pgen.0030007

6. BurgessR

YangZ

2008 Estimation of hominoid ancestral population sizes under bayesian coalescent models incorporating mutation rate variation and sequencing errors. Mol Biol Evol 25 1979 1994

7. HeinJ

SchierupMH

WiufC

2005 Gene genealogies, variation and evolution: A primer in coalescent theory Oxford university press

8. TakahataN

1986 An attempt to estimate the effective size of the ancestral species common to 2 extant species from which homologous genes are sequenced. Genetical Research 48 187 190

9. TakahataN

1989 Gene genealogy in 3 related populations - consistency probability between gene and population trees. Genetics 122 957 966

10. YangZH

1997 On the estimation of ancestral population sizes of modern humans. Genetical Research 69 111 116

11. WallJD

2003 Estimating ancestral population sizes and divergence times. Genetics 163 395 404

12. DutheilJY

GanapathyG

HobolthA

MailundT

UyenoyamaMK

2009 Ancestral population genomics: The coalescent hidden markov model approach. Genetics 183 259 274

13. HeyJ

NielsenR

2004 Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of drosophila pseudoobscura and d. persimilis. Genetics 167 747 760

14. HudsonRR

1983 Properties of a neutral allele model with intragenic recombination. Theor Popul Biol 23 183 201

15. WiufC

HeinJ

1997 On the number of ancestors to a DNA sequence. Genetics 147 1459 1468

16. WiufC

HeinJ

1999 The ancestry of a sample of sequences subject to recombination. Genetics 151 1217 1228

17. LockeD

Unveiling the ancient diversity and slow evolution of the orangutan genome. In progress

18. WiufC

HeinJ

1999 Recombination as a point process along sequences. Theor Popul Biol 55 248 259

19. MarjoramP

WallJ

2006 Fast ‘coalescent’ simulations. BMC Genetics 7 16

20. ChenGK

MarjoramP

WallJD

2009 Fast and exible simulation of DNA sequence data. Genome Res 19 136 42

21. McVickerG

GordonD

DavisC

GreenP

2009 Widespread genomic signatures of natural selection in hominid evolution. PLoS Genet 5 e1000471 doi:10.1371/journal.pgen.1000471

22. FelsensteinJ

1981 Evolutionary trees from dna sequences: a maximum likelihood approach. J Mol Evol 17 368 76

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2011 Číslo 3
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Aktuální možnosti diagnostiky a léčby litiáz
nový kurz
Autori: MUDr. Tomáš Ürge, PhD.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#