Statistical Power to Detect Genetic (Co)Variance of Complex Traits Using SNP Data in Unrelated Samples
Genome-wide association studies (GWAS) have identified thousands of genetic variants for hundreds of traits and diseases. However, the genetic variants discovered from GWAS only explained a small fraction of the heritability, resulting in the question of “missing heritability”. We have recently developed approaches (called GREML) to estimate the overall contribution of all SNPs to the phenotypic variance of a trait (disease) and the proportion of genetic overlap between traits (diseases). A frequently asked question is that how many samples are required to estimate the proportion of variance attributable to all SNPs and the proportion of genetic overlap with useful precision. In this study, we derive the standard errors of the estimated parameters from theory and find that they are highly consistent with those observed values from published results and those obtained from simulation. The theory together with an online application tool will be helpful to plan experimental design to quantify the missing heritability, and to estimate the genetic overlap between traits (diseases) especially when it is unfeasible to have the traits (diseases) measured on the same individuals.
Vyšlo v časopise:
Statistical Power to Detect Genetic (Co)Variance of Complex Traits Using SNP Data in Unrelated Samples. PLoS Genet 10(4): e32767. doi:10.1371/journal.pgen.1004269
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1004269
Souhrn
Genome-wide association studies (GWAS) have identified thousands of genetic variants for hundreds of traits and diseases. However, the genetic variants discovered from GWAS only explained a small fraction of the heritability, resulting in the question of “missing heritability”. We have recently developed approaches (called GREML) to estimate the overall contribution of all SNPs to the phenotypic variance of a trait (disease) and the proportion of genetic overlap between traits (diseases). A frequently asked question is that how many samples are required to estimate the proportion of variance attributable to all SNPs and the proportion of genetic overlap with useful precision. In this study, we derive the standard errors of the estimated parameters from theory and find that they are highly consistent with those observed values from published results and those obtained from simulation. The theory together with an online application tool will be helpful to plan experimental design to quantify the missing heritability, and to estimate the genetic overlap between traits (diseases) especially when it is unfeasible to have the traits (diseases) measured on the same individuals.
Zdroje
1. HindorffLA, SethupathyP, JunkinsHA, RamosEM, MehtaJP, et al. (2009) Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA 106: 9362–9367.
2. ManolioTA, CollinsFS, CoxNJ, GoldsteinDB, HindorffLA, et al. (2009) Finding the missing heritability of complex diseases. Nature 461: 747–753.
3. YangJ, BenyaminB, McEvoyBP, GordonS, HendersAK, et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nat Genet 42: 565–569.
4. LeeSH, WrayNR, GoddardME, VisscherPM (2011) Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet 88: 294–305.
5. DearyIJ, YangJ, DaviesG, HarrisSE, TenesaA, et al. (2012) Genetic contributions to stability and change in intelligence from childhood to old age. Nature 482: 212–215.
6. LeeSH, YangJ, GoddardME, VisscherPM, WrayNR (2012) Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood. Bioinformatics 28: 2540–2542.
7. LeeSH, RipkeS, NealeBM, FaraoneSV, PurcellSM, et al. (2013) Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nat Genet 45: 984–994.
8. SoHC, LiM, ShamPC (2011) Uncovering the total heritability explained by all true susceptibility variants in a genome-wide association study. Genet Epidemiol 35: 447–456.
9. DudbridgeF (2013) Power and predictive accuracy of polygenic risk scores. PLoS Genet 9: e1003348.
10. StahlEA, WegmannD, TrynkaG, Gutierrez-AchuryJ, DoR, et al. (2012) Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nat Genet 44: 483–489.
11. BenjaminDJ, CesariniD, van der LoosMJ, DawesCT, KoellingerPD, et al. (2012) The genetic architecture of economic and political preferences. Proc Natl Acad Sci U S A 109: 8026–8031.
12. MeuwissenTH, HayesBJ, GoddardME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829.
13. YangJ, LeeSH, GoddardME, VisscherPM (2011) GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88: 76–82.
14. PattersonHD, ThompsonR (1971) Recovery of inter-block information when block sizes are unequal. Biometrika 58: 545–554.
15. Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits: Sunderland, MA: Sinauer Associates.
16. WilliamsJT, BlangeroJ (1999) Power of variance component linkage analysis to detect quantitative trait loci. Ann Hum Genet 63: 545–563.
17. ShamPC, ChernySS, PurcellS, HewittJK (2000) Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data. Am J Hum Genet 66: 1616–1630.
18. RijsdijkFV, HewittJK, ShamPC (2001) Analytic power calculation for QTL linkage analysis of small pedigrees. Eur J Hum Genet 9: 335–340.
19. ShamPC, PurcellS (2001) Equivalence between Haseman-Elston and variance-components linkage analyses for sib pairs. Am J Hum Genet 68: 1527–1532.
20. VisscherPM, HopperJL (2001) Power of regression and maximum likelihood methods to map QTL from sib-pair and DZ twin data. Ann Hum Genet 65: 583–601.
21. VinkhuyzenAA, WrayNR, YangJ, GoddardME, VisscherPM (2013) Estimation and Partitioning of Heritability in Human Populations Using Whole Genome Analysis Methods. Annual Review of Genetics 47.
22. YangJ, ManolioTA, PasqualeLR, BoerwinkleE, CaporasoN, et al. (2011) Genome partitioning of genetic variation for complex traits using common SNPs. Nat Genet 43: 519–525.
23. ReeveECR (1955) The variance of the genetic correlation coeffi- cient. Biometrics 11: 357–374.
24. RobertsonA (1959) The sampling variance of the genetic correlation coefficient. Biometrics 15: 469–485.
25. KootsKR, GibsonJP (1996) Realized sampling variances of estimates of genetic parameters and the difference between genetic and phenotypic correlations. Genetics 143: 1409–1416.
26. Falconer DS, Mackay TFC (1996) Introduction to quantitative genetics: England: Longman.
27. ChenWM, AbecasisGR (2006) Estimating the power of variance component linkage analysis in large pedigrees. Genet Epidemiol 30: 471–484.
28. GoddardME (2009) Genomic selection: prediction of accuracy and maximisation of long term. Genetica 136: 245–257.
29. SpeedD, HemaniG, JohnsonMR, BaldingDJ (2012) Improved heritability estimation from genome-wide SNPs. Am J Hum Genet 91: 1011–1021.
30. VisscherPM, YangJ, GoddardME (2010) A Commentary on ‘Common SNPs Explain a Large Proportion of the Heritability for Human Height’ by Yang et al. (2010). Twin Res Hum Genet 13: 517–524.
31. WrayNR, YangJ, HayesBJ, PriceAL, GoddardME, et al. (2013) Author reply to A commentary on Pitfalls of predicting complex traits from SNPs. Nat Rev Genet 14: 894.
32. DevlinB, RoederK (1999) Genomic control for association studies. Biometrics 55: 997–1004.
33. VisscherPM (1998) On the sampling variance of intraclass correlations and genetic correlations. Genetics 149: 1605–1614.
34. LeeSH, HaroldD, NyholtDR, GoddardME, ZondervanKT, et al. (2013) Estimation and partitioning of polygenic variation captured by common SNPs for Alzheimer's disease, multiple sclerosis and endometriosis. Hum Mol Genet 22: 832–841.
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2014 Číslo 4
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- The Sequence-Specific Transcription Factor c-Jun Targets Cockayne Syndrome Protein B to Regulate Transcription and Chromatin Structure
- Genetic Predisposition to In Situ and Invasive Lobular Carcinoma of the Breast
- Widespread Use of Non-productive Alternative Splice Sites in
- RNA Editome in Rhesus Macaque Shaped by Purifying Selection