#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

A Population Genetics-Phylogenetics Approach to Inferring Natural Selection in Coding Sequences


Through an analysis of polymorphism within and divergence between species, we can hope to learn about the distribution of selective effects of mutations in the genome, changes in the fitness landscape that occur over time, and the location of sites involved in key adaptations that distinguish modern-day species. We introduce a novel method for the analysis of variation in selection pressures within and between species, spatially along the genome and temporally between lineages. We model codon evolution explicitly using a joint population genetics-phylogenetics approach that we developed for the construction of multiallelic models with mutation, selection, and drift. Our approach has the advantage of performing direct inference on coding sequences, inferring ancestral states probabilistically, utilizing allele frequency information, and generalizing to multiple species. We use a Bayesian sliding window model for intragenic variation in selection coefficients that efficiently combines information across sites and captures spatial clustering within the genome. To demonstrate the utility of the method, we infer selective pressures acting in Drosophila melanogaster and D. simulans from polymorphism and divergence data for 100 X-linked coding regions.


Vyšlo v časopise: A Population Genetics-Phylogenetics Approach to Inferring Natural Selection in Coding Sequences. PLoS Genet 7(12): e32767. doi:10.1371/journal.pgen.1002395
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1002395

Souhrn

Through an analysis of polymorphism within and divergence between species, we can hope to learn about the distribution of selective effects of mutations in the genome, changes in the fitness landscape that occur over time, and the location of sites involved in key adaptations that distinguish modern-day species. We introduce a novel method for the analysis of variation in selection pressures within and between species, spatially along the genome and temporally between lineages. We model codon evolution explicitly using a joint population genetics-phylogenetics approach that we developed for the construction of multiallelic models with mutation, selection, and drift. Our approach has the advantage of performing direct inference on coding sequences, inferring ancestral states probabilistically, utilizing allele frequency information, and generalizing to multiple species. We use a Bayesian sliding window model for intragenic variation in selection coefficients that efficiently combines information across sites and captures spatial clustering within the genome. To demonstrate the utility of the method, we infer selective pressures acting in Drosophila melanogaster and D. simulans from polymorphism and divergence data for 100 X-linked coding regions.


Zdroje

1. TinbergenN 1963 On aims and methods of ethology. Zeitschrift für Tierpsychologie 20 410 433

2. GouldSJLewontinRC 1979 The spandrels of San Marco and the Panglossian paradigm. Proc Roy Soc Lond B 205 581 598

3. KimuraM 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge

4. Eyre-WalkerAKeightleyPD 2007 The distribution of fitness effects of new mutations. Nat Rev Genet 8 610 618

5. SellaGPetrovDPrzeworskiMAndolfattoP 2009 Pervasive natural selection in the Drosophila genome? PLoS Genet 5 e1000495 doi:10.1371/journal.pgen.1000495

6. PiganeauGEyre-WalkerA 2003 Estimating the distribution of fitness effects from DNA sequence data: implications for the molecular clock. Proc Natl Acad Sci USA 100 10335 10340

7. YampolskyLYKondrashovFAKondrashovAS 2005 Distribution of the strength of selection against amino acid replacements in human proteins. Hum Mol Genet 14 3191 3201

8. BoykoARWilliamsonSHIndapARDegenhardtJDHernandezRD 2008 Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet 4 e1000083 doi:10.1371/journal.pgen.1000083

9. FayJCWuCI 2000 Hitchhiking under positive Darwinian selection. Genetics 155 1405 1413

10. SmithNGCEyre-WalkerA 2002 Adaptive protein evolution in Drosophila. Nature 415 1022 1024

11. AndolfattoP 2007 Hitchhiking effects of recurrent beneficial amino acid substitutions in Drosophila melanogaster. 17 1755 1762

12. MacphersonJMSellaGDavisJCPetrovDA 2007 Genomewide spatial correspondence between nonsynonymous divergence and neutral polymorphism reveals extensive adaptation in Drosophila. Genetics 177 2083 2099

13. GiladYBustamanteCDLancetDPääboS 2003 Natural selection on the olfactory receptor gene family in humans and chimpanzees. Am J Hum Genet 73 489 501

14. SabetiPCSchaffnerSFFryBLohumuellerJVarillyP 2006 Positive natural selection in the human lineage. Science 312 1614 1620

15. WilliamsonSHHubiszMJClarkAGPayseurBABustamanteCD 2007 Localizing recent adaptive evolution in the human genome. PLoS Genet 6 e90 doi:10.1371/journal.pgen.0030090

16. McDonaldJHKreitmanM 1991 Adaptive protein evolution at the Adh locus in Drosophila. Nature 351 652 654

17. KimuraM 1977 Preponderance of synonymous changes as evidence for the neutral theory of molecular evolution. Nature 267 275 276

18. SawyerSAHartlDL 1992 Population genetics of polymorphism and divergence. Genetics 132 1161 1176

19. CharlesworthB 1994 The effect of background selection against deleterious mutations on weakly selected, linked variants. Genet Res 63 213 227

20. BustamanteCDNielsenRSawyerSAOlsenKAPuruggananMD 2002 The cost of inbreeding in Arabidopsis. Nature 416 531 534

21. BustamanteCDFledel-AlonAWilliamsonSNielsenRHubiszMT 2005 Natural selection on protein-coding genes in the human genome. Nature 437 1153 1157

22. SawyerSAKulathinalRJBustamanteCDHartlDL 2003 Bayesian analysis suggests that most amino acid replacements in Drosophila are driven by positive selection. J Mol Evol 57 S154 S164

23. SawyerSAParschJZhangZHartlDL 2007 Prevalence of positive selection among nearly neutral amino acid replacements in Drosophila. Proc Natl Acad Sci USA 104 6504 6510

24. BainesJFSawyerSAHartlDLParschJ 2008 Effects of X-linkage and sex-biased gene expression on the rate of adaptive protein evolution in Drosophila. Mol Biol Evol 25 1639 1650

25. FayJWycoffGJWuC-I 2001 Positive and negative selection on the human genome. Genetics 158 1227 1234

26. CharlesworthJEyre-WalkerA 2008 The McDonald-Kreitman test and slightly deleterious mutations. Mol Biol Evol 25 1007 1015

27. NielsenRBustamanteCDClarkAGGlanowskiSSacktonTB 2005 A scan for positively selected genes in the genomes of humans and chimpanzees. PLoS Biol 3 e170 doi:10.1371/journal.pbio.0030170

28. DoningerSWKimHSSwainDCorcueraDWilliamsM 2008 A catalog of neutral and deleterious polymorphism in yeast. PLoS Genet 4 e1000183 doi:10.1371/journal.pgen.1000183

29. NielsenRYangZ 1998 Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148 929 936

30. YangZ 2007 PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24 1586 1591

31. WilsonDJMcVeanG 2006 Estimating diversifying selection and functional constraint in the presence of recombination. Genetics 172 1411 1425

32. KernADHausslerD 2010 A population genetic hidden Markov model for detecting genomic regions under selection. Mol Biol Evol 27 1673 1685

33. AndolfattoPWongKMBachtrogD 2011 Effective population size and the efficacy of selection on the X chromosomes of two closely related Drosophila species. Genome Biol Evol 3 114 128

34. HasegawaMKishinoHYanoT 1985 Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol 22 160 174

35. NielsenRYangZ 2003 Estimating the distribution of selection coefficients from phylogenetic data with applications to mitochondrial and viral DNA. Mol Biol Evol 20 1231 1239

36. Drosophila 12 Genomes Consortium 2007 Evolution of genes and genomes on the Drosophila phylogeny. Nature 450 203 218

37. HaddrillPRThorntonKRCharlesworthBAndolfattoP 2005 Multilocus patterns on nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. Genome Res 15 790 799

38. KeightleyPDEyre-WalkerA 2007 Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies. Genetics 177 2251 2261

39. Eyre-WalkerAKeightleyPD 2009 Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. Mol Biol Evol 26 2097 2108

40. HernandezRD 2008 A flexible forward simulator for populations subject to selection and demography. Bioinformatics 24 2786 2787

41. CallahanBNeherRABachtrogDAndolfattoPShraimanBI 2011 Correlated evolution of nearby residues in Drosophilid proteins. PLoS Genet 7 e1001315 doi:10.1371/journal.pgen.1001315

42. BustamanteCDWakeleyJSawyerSHartlDL 2001 Directional selection and the site-frequency spectrum. Genetics 159 1779 1788

43. DonnellyPNordborgMJoyceP 2001 Likelihoods and simulation methods for a class of nonneutral population genetics models. Genetics 159 853 867

44. HernandezRDWilliamsonSHBustamanteCD 2007 Context dependence, ancestral misidentification, and spurious signatures of natural selection. Mol Biol Evol 24 1792 1800

45. YangZNielsenR 2002 Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol Biol Evol 19 908 917

46. ZhangJNielsenRYangZ 2005 An improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol 22 2472 2479

47. RonquistFHuelsenbeckJP 2003 MRBAYES 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19 1572 1574

48. HoSYShapiroBPhillipsMJCooperADrummondAJ 2007 Evidence for time dependency of molecular rate estimates. Syst Biol 56 515 522

49. KryazhimskiySPlotkinJ 2008 The population genetics of dN/dS. PLoS Genet 4 e1000304 doi:10.1371/journal.pgen.1000304

50. LiYFCostelloJCHollowayAKHahnMW 2008 “Reverse ecology” and the power of population genomics. Evolution 62 2984 2994

51. HillWGRobertsonA 1966 The effect of linkage on limits to artificial selection. Genet Res 8 269 294

52. PlotkinJBKudlaG 2011 Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet 12 32 42

53. DesaiMMPlotkinJB 2008 The polymorphism frequency spectrum of finitely many sites under selection. Genetics 180 2175 2191

54. OhtaTGillespieJH 1996 Development of neutral and nearly-neutral theories. Theor Popul Biol 49 128 142

55. FelsensteinJ 1973 Maximum likelihood and minimum-steps methods for estimating evolutionary trees from data on discrete characters. Syst Zool 22 240 249

56. WrightS 1949 Adaptation and selection. JepsonGLSimpsonGGMayrE Genetics, Paleontology and Evolution Princeton University Press 365 389

57. WattersonGA 1977 Heterosis or neutrality? Genetics 85 789 814

58. KimuraM 1955 Stochastic processes and distribution of gene frequencies under natural selection. Cold Spring Harbor Symp Quant Biol 20 33 55

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2011 Číslo 12
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Aktuální možnosti diagnostiky a léčby litiáz
nový kurz
Autori: MUDr. Tomáš Ürge, PhD.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#