#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

A Flexible Bayesian Model for Studying Gene–Environment Interaction


An important follow-up step after genetic markers are found to be associated with a disease outcome is a more detailed analysis investigating how the implicated gene or chromosomal region and an established environment risk factor interact to influence the disease risk. The standard approach to this study of gene–environment interaction considers one genetic marker at a time and therefore could misrepresent and underestimate the genetic contribution to the joint effect when one or more functional loci, some of which might not be genotyped, exist in the region and interact with the environment risk factor in a complex way. We develop a more global approach based on a Bayesian model that uses a latent genetic profile variable to capture all of the genetic variation in the entire targeted region and allows the environment effect to vary across different genetic profile categories. We also propose a resampling-based test derived from the developed Bayesian model for the detection of gene–environment interaction. Using data collected in the Environment and Genetics in Lung Cancer Etiology (EAGLE) study, we apply the Bayesian model to evaluate the joint effect of smoking intensity and genetic variants in the 15q25.1 region, which contains a cluster of nicotinic acetylcholine receptor genes and has been shown to be associated with both lung cancer and smoking behavior. We find evidence for gene–environment interaction (P-value = 0.016), with the smoking effect appearing to be stronger in subjects with a genetic profile associated with a higher lung cancer risk; the conventional test of gene–environment interaction based on the single-marker approach is far from significant.


Vyšlo v časopise: A Flexible Bayesian Model for Studying Gene–Environment Interaction. PLoS Genet 8(1): e32767. doi:10.1371/journal.pgen.1002482
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1002482

Souhrn

An important follow-up step after genetic markers are found to be associated with a disease outcome is a more detailed analysis investigating how the implicated gene or chromosomal region and an established environment risk factor interact to influence the disease risk. The standard approach to this study of gene–environment interaction considers one genetic marker at a time and therefore could misrepresent and underestimate the genetic contribution to the joint effect when one or more functional loci, some of which might not be genotyped, exist in the region and interact with the environment risk factor in a complex way. We develop a more global approach based on a Bayesian model that uses a latent genetic profile variable to capture all of the genetic variation in the entire targeted region and allows the environment effect to vary across different genetic profile categories. We also propose a resampling-based test derived from the developed Bayesian model for the detection of gene–environment interaction. Using data collected in the Environment and Genetics in Lung Cancer Etiology (EAGLE) study, we apply the Bayesian model to evaluate the joint effect of smoking intensity and genetic variants in the 15q25.1 region, which contains a cluster of nicotinic acetylcholine receptor genes and has been shown to be associated with both lung cancer and smoking behavior. We find evidence for gene–environment interaction (P-value = 0.016), with the smoking effect appearing to be stronger in subjects with a genetic profile associated with a higher lung cancer risk; the conventional test of gene–environment interaction based on the single-marker approach is far from significant.


Zdroje

1. HindorffLAJunkinsHAHallPNMehtaJPManolioTA 2011 A catalog of published genome-wide association studies. Available at: www.genome.gov/gwastudies. Accessed August, 2011

2. LindstromSSchumacherFSiddiqATravisRCCampaD 2011 Characterizing associations and SNP-environment interactions for GWAS-identified prostate cancer risk markers-Results from BPC3. PLoS ONE 6 e17142 doi:10.1371/journal.pone.0017142

3. RothmanNGarcia-ClosasMChatterjeeNMalatsNWuX 2010 A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci. Nat Genet 42 978 984

4. SpitzMRAmosCIDongQLinJWuX 2008 The CHRNA5-A3 region on chromosome 15q24–25.1 is a risk factor both for nicotine dependence and for lung cancer. J Natl Cancer Inst 100 1552 1556

5. MooreJHAsselbergsFWWilliamsSM 2010 Bioinformatics challenges for genome-wide association studies. Bioinformatics 26 445 455

6. GreenPRichardsonS 2002 Hidden Markov models and disease mapping. J Am Stat Assoc 97 1055 1070

7. PottsRB 1952 Some generalized order-disorder transformations. Cambridge Philos Soc Math Proc 48 106 109

8. ThomasDCStramDOContiDMolitorJMarjoramP 2003 Bayesian spatial modeling of haplotype associations. Hum Hered 56 32 40

9. MoltchanovaEVPitkaniemiJHaapalaL 2005 Potts model for haplotype associations. BMC Genet 6 Suppl 1 S64

10. LiuJS 2002 Monte Carlo Strategies in Scientific Computing New York Springer

11. RobertCPCasellaG 1999 Monte Carlo Statistical Methods New York Springer

12. LiangFLiuCCarrollRJ 2010 Advanced Markov Chain Monte Carlo Methods: Learning from Past Samples Wiley

13. LiangF 2008 Clustering gene expression profiles using mixture model ensemble averaging approach. JP J Biostat 2 57 80

14. MolitorJParathomasMJerrettMRichardsonS 2010 Bayesian profile regrression with an application to the national survey of children's health. Biostatistics 11 484 498

15. LandiMTChatterjeeNYuKGoldinLRGoldsteinAM 2009 A genome-wide association study of lung cancer identifies a region of chromosome 5p15 associated with risk for adenocarcinoma. Am J Hum Genet 85 679 691

16. AmosCIWuXBroderickPGorlovIPGuJ 2008 Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1. Nat Genet 40 616 622

17. ThorgeirssonTEGellerFSulemPRafnarTWisteA 2008 A variant associated with nicotine dependence, lung cancer and peripheral arterial disease. Nature 452 638 642

18. ThorgeirssonTEGudbjartssonDFSurakkaIVinkJMAminN 2010 Sequence variants at CHRNB3-CHRNA6 and CYP2A6 affect smoking behavior. Nat Genet 42 448 453

19. SacconeNLCulverhouseRCSchwantes-AnTHCannonDSChenX 2010 Multiple independent loci at chromosome 15q25.1 affect smoking quantity: a meta-analysis and comparison with lung cancer and COPD. PLoS Genet 6 e1001053 doi:10.1371/journal.pgen.1001053

20. LiuJZTozziFWaterworthDMPillaiSGMugliaP 2010 Meta-analysis and imputation refines the association of 15q25 with smoking quantity. Nat Genet 42 436 440

21. ConsortiumTaG 2010 Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat Genet 42 441 447

22. CaporasoNGuFChatterjeeNSheng-ChihJYuK 2009 Genome-wide and candidate gene association study of cigarette smoking behaviors. PLoS ONE 4 e4653 doi:10.1371/journal.pone.0004653

23. StaicuA 2010 On the equivalence of prospective and retrospective likelihood methods in case-control studies. Biometrika 97 990 996

24. SeamanSRRichardsonS 2001 Bayesian analysis of case-control studies with categorical covariates. Biometrika 88 1073 1088

25. BorgsCChayesJTFriezeAKimJHTetaliP Torpid mixing of some Monte Carlo Markov chain algorithms in statistical physics; 1999; Washington, DC

26. MillerP 1993 Alternative to the Gibbs sampling scheme. Tech. Report, Institute of Statistics and Decision Science

27. OgataYTanemuraM 1984 Likelihood analysis of spatial point patterns. J Royal Stat Soc, Ser B 46 496 518

28. SpiegelhalterDJBestNGCarlinBPvan der LindeA 2002 Bayesian measures of model complexity and fit (with discussion). J R Stat Soc Ser B 64 583 639

29. BreimanLFriedmanJHOlshenRAStoneCJ 1984 Classification and Regression Trees Monterey Wadsworth and Brooks/Cole

30. KaufmanLRousseeuwPJ 2005 Finding Groups in Data: An Introduction to Cluster Analysis Hoboken, NJ Wiley-Interscience

31. EfronBTibshiraniRJ 1993 An Introduction to the Bootstrap New York Chapman & Hall

32. YuKLiQBergenAWPfeifferRMRosenbergPS 2009 Pathway analysis by adaptive combination of P-values. Genet Epidemiol 33 700 709

33. GelmanARubinDB 1992 Inference from iterative simulation using multiple sequences. Stat Sci 7 457 511

34. PlummerMBestNCowlesKVinesK 2006 CODA: Convergence diagnosis and output analysis for MCMC. R News 6 7 11

35. Fields Development Team 2006 Fields: Tools for Spatial Data. National Center for Atmospheric Research. Boulder, CO

36. Garcia-ClosasMRothmanNLubinJ 1999 Misclassification in case-control studies of gene–environment interactions: assessment of bias and sample size. Cancer Epidemiol Biomarkers Prev 8 1043 1050

37. HayesRBSigurdsonAMooreLPetersUHuangWY 2005 Methods for etiologic and early marker investigations in the PLCO trial. Mutat Res 592 147 154

38. GreenP 1995 Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82 711 732

39. SeamanSRRichardsonS 2004 Equivalence of prospective and restrospective models in the Bayesian analysis of case-control studies. Biometrika 91 15 25

40. CostainDA 2009 Bayesian partitioning for modeling and mapping spatial case-control data. Biometrics 65 1123 1132

41. RafteryAERichardsonS 1996 Model selection for generalized linear models via GLIB, with application to epidemiology. BerryDAStanglDK Bayesian Biostatistics New York Marcel Dekker 321 354

42. TangWWuXJiangRLiY 2009 Epistatic module detection for case-control studies: a Bayesian model with a Gibbs sampling strategy. PLoS Genet 5 e1000464 doi:10.1371/journal.pgen.1000464

43. ChatterjeeNKalayliogluZMoslehiRPetersUWacholderS 2006 Powerful multilocus tests of genetic association in the presence of gene-gene and gene–environment interactions. Am J Hum Genet 79 1002 1016

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2012 Číslo 1
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#