Lessons from on the Strengths and Weaknesses of Structured Association Mapping
The strengths of association mapping lie in its resolution and allelic richness, but spurious associations arising from historical relationships and selection patterns need to be accounted for in statistical analyses. Here we reanalyze one of the first generation structured association mapping studies of the Dwarf8 (d8) locus with flowering time in maize using the full range of new mapping populations, statistical approaches, and haplotype maps. Because this trait was highly correlated with population structure, we found that basic structured association methods overestimate phenotypic effects in the region, while mixed model approaches perform substantially better. Combined with analysis of the maize nested association mapping population (a multi-family crossing design), it is concluded that most, if not all, of the QTL effects at the general location of the d8 locus are from rare extended haplotypes that include other linked QTLs and that d8 is unlikely to be involved in controlling flowering time in maize. Previous independent studies have shown evidence for selection at the d8 locus. Based on the evidence of population bottleneck, selection patterns, and haplotype structure observed in the region, we suggest that multiple traits may be strongly correlated with population structure and that selection on these traits has influenced segregation patterns in the region. Overall, this study provides insight into how modern association and linkage mapping, combined with haplotype analysis, can produce results that are more robust.
Vyšlo v časopise:
Lessons from on the Strengths and Weaknesses of Structured Association Mapping. PLoS Genet 9(2): e32767. doi:10.1371/journal.pgen.1003246
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1003246
Souhrn
The strengths of association mapping lie in its resolution and allelic richness, but spurious associations arising from historical relationships and selection patterns need to be accounted for in statistical analyses. Here we reanalyze one of the first generation structured association mapping studies of the Dwarf8 (d8) locus with flowering time in maize using the full range of new mapping populations, statistical approaches, and haplotype maps. Because this trait was highly correlated with population structure, we found that basic structured association methods overestimate phenotypic effects in the region, while mixed model approaches perform substantially better. Combined with analysis of the maize nested association mapping population (a multi-family crossing design), it is concluded that most, if not all, of the QTL effects at the general location of the d8 locus are from rare extended haplotypes that include other linked QTLs and that d8 is unlikely to be involved in controlling flowering time in maize. Previous independent studies have shown evidence for selection at the d8 locus. Based on the evidence of population bottleneck, selection patterns, and haplotype structure observed in the region, we suggest that multiple traits may be strongly correlated with population structure and that selection on these traits has influenced segregation patterns in the region. Overall, this study provides insight into how modern association and linkage mapping, combined with haplotype analysis, can produce results that are more robust.
Zdroje
1. ThornsberryJM, GoodmanMM, DoebleyJ, KresovichS, NielsenD, et al. (2001) Dwarf8 polymorphisms associate with variation in flowering time. Nat Genet 28: 286–289.
2. WilsonLM, WhittSR, IbanezAM, RochefordTR, GoodmanMM, et al. (2004) Dissection of Maize Kernel Composition and Starch Production by Candidate Gene Association. Plant Cell 16: 2719–2733.
3. BreseghelloF, SorrellsME (2006) Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172: 1165–1177.
4. BelóA, ZhengP, LuckS, ShenB, MeyerDJ, et al. (2008) Whole genome scan detects an allelic variant of fad2 associated with increased oleic acid levels in maize. Molecular Genet Genomics 279: 1–10.
5. González-MartínezSC, HuberD, ErsozE, DavisJM, NealeDB (2008) Association genetics in Pinus taeda L. II. Carbon isotope discrimination. Heredity 101: 19–26.
6. HarjesCE, RochefordTR, BaiL, BrutnellTP, KandianisCB, et al. (2008) Natural genetic variation in lycopene epsilon cyclase tapped for maize biofortification. Science (New York, NY) 319: 330–333.
7. AtwellS, HuangYS, VilhjálmssonBJ, WillemsG, HortonM, et al. (2010) Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature 465: 627–631.
8. YanJ, KandianisCB, HarjesCE, BaiL, KimE-H, et al. (2010) Rare genetic variation at Zea mays crtRB1 increases beta-carotene in maize grain. Nat Genet 42: 322–327.
9. KumpKL, BradburyPJ, WisserRJ, BucklerES, BelcherAR, et al. (2011) Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population. Nat Genet 43: 163–168.
10. TianF, BradburyPJ, BrownPJ, HungH, SunQ, et al. (2011) Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat Genet 43: 159–162.
11. BucklerES, HollandJB, BradburyPJ, AcharyaCB, BrownPJ, et al. (2009) The genetic architecture of maize flowering time. Science (New York, NY) 325: 714–718.
12. McMullenMD, KresovichS, VilledaHS, BradburyP, LiH, et al. (2009) Genetic properties of the maize nested association mapping population. Science (New York, NY) 325: 737–740.
13. BergelsonJ, RouxF (2010) Towards identifying genes underlying ecologically relevant traits in Arabidopsis thaliana. Nat Rev Genet 11: 867–879.
14. LiH, BradburyP, ErsozE, BucklerES, WangJ (2011) Joint QTL linkage mapping for multiple-cross mating design sharing one common parent. PLoS ONE 6: e17573 doi:10.1371/journal.pone.0017573
15. PlattA, VilhjálmssonBJ, NordborgM (2010) Conditions under which genome-wide association studies will be positively misleading. Genetics 186: 1045–1052.
16. PritchardJK, RosenbergNA (1999) Use of unlinked genetic markers to detect population stratification in association studies. Am J Hum Genet 65: 220–228.
17. PritchardJK, StephensM, RosenbergNA, DonnellyP (2000) Association mapping in structured populations. Am J Hum Genet 67: 170–181.
18. PritchardJK, StephensM, DonnellyP (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945–959.
19. FalushD, StephensM, PritchardJK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164: 1567–1587.
20. PriceAL, PattersonNJ, PlengeRM, WeinblattME, ShadickNA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38: 904–909.
21. KangHM, ZaitlenNA, WadeCM, KirbyA, HeckermanD, et al. (2008) Efficient control of population structure in model organism association mapping. Genetics 178: 1709–1723.
22. YuJ, PressoirG, BriggsWH, BiIV, YamasakiM, et al. (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38: 203–208.
23. Henderson CR (1984) Applications of linear models in animal breeding. Guelph: University of Guelph.
24. GeorgeAW, VisscherPM, HaleyCS (2000) Mapping quantitative trait loci in complex pedigrees: a two-step variance component approach. Genetics 156: 2081–2092.
25. PengJ, RichardsDE, HartleyNM, MurphyGP, DevosKM, et al. (1999) “Green revolution” genes encode mutant gibberellin response modulators. Nature 400: 256–261.
26. AndersenJR, SchragT, MelchingerAE, ZeinI, LübberstedtT (2005) Validation of Dwarf8 polymorphisms associated with flowering time in elite European inbred lines of maize (Zea mays L.). Theor Appl Genet 111: 206–217.
27. Camus-KulandaiveluL, VeyrierasJ-B, MadurD, CombesV, FourmannM, et al. (2006) Maize adaptation to temperate climate: relationship between population structure and polymorphism in the Dwarf8 gene. Genetics 172: 2449–2463.
28. HarberdNP, KingKE, CarolP, CowlingRJ, PengJ, et al. (1998) Gibberellin: inhibitor of an inhibitor of…? BioEssays 20: 1001–1008.
29. DoebleyJF, GoodmanMM, StuberCW (1986) Exceptional Genetic Divergence of Northern Flint Corn. Am J Bot 73: 64–69.
30. DoebleyJ, WendelJD, SmithJSC, StuberCW, GoodmanMM (1988) The origin of cornbelt maize: The isozyme evidence. Econ Bot 42: 120–131.
31. StuderA, ZhaoQ, Ross-IbarraJ, DoebleyJ (2011) Identification of a functional transposon insertion in the maize domestication gene tb1. Nat Genet 43: 1160–1163.
32. WangR-L, StecA, HeyJ, LukensL, DoebleyJ (1999) The limits of selection during maize domestication. Nature 398: 236–239.
33. SalviS, SponzaG, MorganteM, TomesD, NiuX, et al. (2007) Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proc Natl Acad Sci U S A 104: 11376–11381.
34. DucrocqS, MadurD, VeyrierasJ-B, Camus-KulandaiveluL, Kloiber-MaitzM, et al. (2008) Key impact of Vgt1 on flowering time adaptation in maize: evidence from association mapping and ecogeographical information. Genetics 178: 2433–2437.
35. GoreMA, ChiaJ-M, ElshireRJ, SunQ, ErsozES, et al. (2009) A first-generation haplotype map of maize. Science (New York, NY) 326: 1115–1117.
36. ChiaJ-M, SongC, BradburyPJ, CostichD, De LeonN, et al. (2012) Maize HapMap2 identifies extant variation from a genome in flux. Nat Genet 44: 803–807.
37. Camus-KulandaiveluL, ChevinL-M, Tollon-CordetC, CharcossetA, ManicacciD, et al. (2008) Patterns of molecular evolution associated with two selective sweeps in the Tb1-Dwarf8 region in maize. Genetics 180: 1107–1121.
38. HuffordMB, XuX, Van HeerwaardenJ, PyhäjärviT, ChiaJ-M, et al. (2012) Comparative population genomics of maize domestication and improvement. Nat Genet 44: 808–811.
39. ZhaoK, AranzanaMJ, KimS, ListerC, ShindoC, et al. (2007) An Arabidopsis example of association mapping in structured samples. PLoS Genet 3: e4 doi:10.1371/journal.pgen.0030004
40. ClarkRM, LintonE, MessingJ, DoebleyJF (2004) Pattern of diversity in the genomic region near the maize domestication gene tb1. Proc Natl Acad Sci U S A 101: 700–707.
41. TenaillonMI, U'RenJ, TenaillonO, GautBS (2004) Selection versus demography: a multilocus investigation of the domestication process in maize. Mol Biol Evol 21: 1214–1225.
42. Flint-GarciaSA, ThuilletA-C, YuJ, PressoirG, RomeroSM, et al. (2005) Maize association population: a high-resolution platform for quantitative trait locus dissection. Plant J 44: 1054–1064.
43. YuJ, HollandJB, McMullenMD, BucklerES (2008) Genetic design and statistical power of nested association mapping in maize. Genetics 178: 539–551.
44. Gilmour AR, Gogel BJ, Cullis BR, Thompson R (2005) ASReml User Guide.
45. BradburyPJ, ZhangZ, KroonDE, CasstevensTM, RamdossY, et al. (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23: 2633–2635.
46. SAS Institute (2004) SAS/STAT user's guide. Version 9.2. SAS Inst., Cary, NC
47. StichB, MöhringJ, PiephoH-P, HeckenbergerM, BucklerES, et al. (2008) Comparison of mixed-model approaches for association mapping. Genetics 178: 1745–1754.
48. HardyOJ, VekemansX (2002) spagedi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes 2: 618–620.
49. ElshireRJ, GlaubitzJC, SunQ, PolandJA, KawamotoK, et al. (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6: e19379 doi:10.1371/journal.pone.0019379
50. LoiselleBA, SorkVL, NasonJ, GrahamC (1995) Spatial Genetic Structure of a Tropical Understory Shrub, Psychotria officinalis (Rubiaceae). Am J Bot 82: 1420–1425.
51. BenjaminiY, HochbergY (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc 57: 289–300.
52. LipkaAE, TianF, WangQ, PeifferJ, LiM, et al. (2012) GAPIT: Genome Association and Prediction Integrated Tool. Bioinformatics 1–2.
53. ChurchillGA, DoergeRW (1994) Empirical threshold values for quantitative trait mapping. Genetics 138: 963–971.
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2013 Číslo 2
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Complex Inheritance of Melanoma and Pigmentation of Coat and Skin in Grey Horses
- Coordination of Chromatid Separation and Spindle Elongation by Antagonistic Activities of Mitotic and S-Phase CDKs
- Autophagy Induction Is a Tor- and Tp53-Independent Cell Survival Response in a Zebrafish Model of Disrupted Ribosome Biogenesis
- Assembly of the Auditory Circuitry by a Genetic Network in the Mouse Brainstem