Quantifying the Underestimation of Relative Risks from Genome-Wide Association Studies
Genome-wide association studies (GWAS) have identified hundreds of associated loci across many common diseases. Most risk variants identified by GWAS will merely be tags for as-yet-unknown causal variants. It is therefore possible that identification of the causal variant, by fine mapping, will identify alleles with larger effects on genetic risk than those currently estimated from GWAS replication studies. We show that under plausible assumptions, whilst the majority of the per-allele relative risks (RR) estimated from GWAS data will be close to the true risk at the causal variant, some could be considerable underestimates. For example, for an estimated RR in the range 1.2–1.3, there is approximately a 38% chance that it exceeds 1.4 and a 10% chance that it is over 2. We show how these probabilities can vary depending on the true effects associated with low-frequency variants and on the minor allele frequency (MAF) of the most associated SNP. We investigate the consequences of the underestimation of effect sizes for predictions of an individual's disease risk and interpret our results for the design of fine mapping experiments. Although these effects mean that the amount of heritability explained by known GWAS loci is expected to be larger than current projections, this increase is likely to explain a relatively small amount of the so-called “missing” heritability.
Vyšlo v časopise:
Quantifying the Underestimation of Relative Risks from Genome-Wide Association Studies. PLoS Genet 7(3): e32767. doi:10.1371/journal.pgen.1001337
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1001337
Souhrn
Genome-wide association studies (GWAS) have identified hundreds of associated loci across many common diseases. Most risk variants identified by GWAS will merely be tags for as-yet-unknown causal variants. It is therefore possible that identification of the causal variant, by fine mapping, will identify alleles with larger effects on genetic risk than those currently estimated from GWAS replication studies. We show that under plausible assumptions, whilst the majority of the per-allele relative risks (RR) estimated from GWAS data will be close to the true risk at the causal variant, some could be considerable underestimates. For example, for an estimated RR in the range 1.2–1.3, there is approximately a 38% chance that it exceeds 1.4 and a 10% chance that it is over 2. We show how these probabilities can vary depending on the true effects associated with low-frequency variants and on the minor allele frequency (MAF) of the most associated SNP. We investigate the consequences of the underestimation of effect sizes for predictions of an individual's disease risk and interpret our results for the design of fine mapping experiments. Although these effects mean that the amount of heritability explained by known GWAS loci is expected to be larger than current projections, this increase is likely to explain a relatively small amount of the so-called “missing” heritability.
Zdroje
1. ManolioTA
BrooksLD
CollinsFS
2008 A HapMap harvest of insights into the genetics of common disease. J Clin Invest 118 1590 1605
2. FrazerKA
BallingerDG
CoxDR
HindsDA
StuveLL
2007 A second generation human haplotype map of over 3.1 million SNPs. Nature 449 851 861
3. MarchiniJ
HowieB
MyersS
McVeanG
DonnellyP
2007 A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet 39 906 913
4. ZollnerS
PritchardJK
2007 Overcoming the winner's curse: estimating penetrance parameters from case-control data. Am J Hum Genet 80 605 615
5. IlesMM
2008 What can genome-wide association studies tell us about the genetics of common disease? PLoS Genet 4 e33 doi:10.1371/journal.pgen.0040033
6. 2005 A haplotype map of the human genome. Nature 437 1299 1320
7. WakefieldJ
2008 Bayes factors for genome-wide association studies: comparison with P-values. Genet Epidemiol 33 79 86
8. WangWY
BarrattBJ
ClaytonDG
ToddJA
2005 Genome-wide association studies: theoretical and practical concerns. Nat Rev Genet 6 109 118
9. PritchardJK
2001 Are rare variants responsible for susceptibility to complex diseases? Am J Hum Genet 69 124 137
10. ZondervanKT
CardonLR
2004 The complex interplay among factors that influence allelic association. Nat Rev Genet 5 89 100
11. BarrettJC
HansoulS
NicolaeDL
ChoJH
DuerrRH
2008 Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat Genet 40 955 962
12. ZegginiE
ScottLJ
SaxenaR
VoightBF
MarchiniJL
2008 Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet 40 638 645
13. PharoahPD
AntoniouAC
EastonDF
PonderBA
2008 Polygenes, risk prediction, and targeted prevention of breast cancer. N Engl J Med 358 2796 2803
14. RischN
1990 Linkage strategies for genetically complex traits. II. The power of affected relative pairs. Am J Hum Genet 46 229 241
15. ThomasDC
2004 Statistical methods in genetic epidemiology Oxford Oxford University Press xxi, 435
16. PharoahPD
DayNE
DuffyS
EastonDF
PonderBA
1997 Family history and the risk of breast cancer: a systematic review and meta-analysis. Int J Cancer 71 800 809
17. WeijnenCF
RichSS
MeigsJB
KrolewskiAS
WarramJH
2002 Risk of diabetes in siblings of index cases with Type 2 diabetes: implications for genetic studies. Diabet Med 19 41 50
18. OrholmM
MunkholmP
LangholzE
NielsenOH
SorensenTI
1991 Familial occurrence of inflammatory bowel disease. N Engl J Med 324 84 88
19. The Wellcome Trust Case Control Consortium 2007 Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447 661 678
20. MaherB
2008 Personal genomes: The case of the missing heritability. Nature 456 18 21
21. ClaytonDG
2009 Prediction and interaction in complex disease genetics: experience in type 1 diabetes. PLoS Genet 5 e1000540 doi:10.1371/journal.pgen.1000540
22. WallaceC
ClaytonD
2003 Estimating the relative recurrence risk ratio using a global cross-ratio model. Genet Epidemiol 25 293 302
23. 2004 The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 306 636 640
24. 2003 The International HapMap Project. Nature 426 789 796
25. MyersS
BottoloL
FreemanC
McVeanG
DonnellyP
2005 A fine-scale map of recombination rates and hotspots across the human genome. Science 310 321 324
26. ArmitageP
1955 Tests for linear trends in proportions and frequencies. Biometrics 11 375 386
27. SchoutenEG
DekkerJM
KokFJ
Le CessieS
Van HouwelingenHC
1993 Risk ratio and rate ratio estimation in case-cohort designs: hypertension and cardiovascular mortality. Stat Med 12 1733 1745
28. StephensM
BaldingDJ
2009 Bayesian statistical methods for genetic association studies. Nat Rev Genet 10 681 690
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2011 Číslo 3
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Whole-Exome Re-Sequencing in a Family Quartet Identifies Mutations As the Cause of a Novel Skeletal Dysplasia
- Origin-Dependent Inverted-Repeat Amplification: A Replication-Based Model for Generating Palindromic Amplicons
- FUS Transgenic Rats Develop the Phenotypes of Amyotrophic Lateral Sclerosis and Frontotemporal Lobar Degeneration
- Limited dCTP Availability Accounts for Mitochondrial DNA Depletion in Mitochondrial Neurogastrointestinal Encephalomyopathy (MNGIE)