Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.
Vyšlo v časopise:
Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology. PLoS Genet 7(1): e32767. doi:10.1371/journal.pgen.1001273
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1001273
Souhrn
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.
Zdroje
1. RaychaudhuriS
ThomsonBP
RemmersEF
EyreS
HinksA
2009 Genetic variants at CD28, PRDM1 and CD2/CD58 are associated with rheumatoid arthritis risk. Nat Genet 41 1313 1318
2. BarrettJC
HansoulS
NicolaeDL
ChoJH
DuerrRH
2008 Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat Genet 40 955 962
3. BarrettJC
ClaytonDG
ConcannonP
AkolkarB
CooperJD
2009 Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes. Nat Genet Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/19430480. Accessed 19 March 2010
4. BarrettJC
LeeJC
LeesCW
PrescottNJ
AndersonCA
2009 Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat Genet 41 1330 1334
5. De JagerPL
JiaX
WangJ
de BakkerPIW
OttoboniL
2009 Meta-analysis of genome scans and replication identify CD6, IRF8 and TNFRSF1A as new multiple sclerosis susceptibility loci. Nat Genet 41 776 782
6. GatevaV
SandlingJK
HomG
TaylorKE
ChungSA
2009 A large-scale replication study identifies TNIP1, PRDM1, JAZF1, UHRF1BP1 and IL10 as risk loci for systemic lupus erythematosus. Nat Genet 41 1228 1233
7. HuntKA
ZhernakovaA
TurnerG
HeapGAR
FrankeL
2008 Newly identified genetic risk variants for celiac disease related to the immune response. Nat Genet 40 395 402
8. RaychaudhuriS
2010 Recent advances in the genetics of rheumatoid arthritis. Curr Opin Rheumatol 22 109 118
9. DupuisJ
LangenbergC
ProkopenkoI
SaxenaR
SoranzoN
2010 New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 42 105 116
10. GudbjartssonDF
WaltersGB
ThorleifssonG
StefanssonH
HalldorssonBV
2008 Many sequence variants affecting diversity of adult human height. Nat Genet 40 609 615
11. KathiresanS
MelanderO
GuiducciC
SurtiA
BurttNP
2008 Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nat Genet 40 189 197
12. McCarthyMI
ZegginiE
2009 Genome-wide association studies in type 2 diabetes. Curr Diab Rep 9 164 171
13. LettreG
JacksonAU
GiegerC
SchumacherFR
BerndtSI
2008 Identification of ten loci associated with height highlights new biological pathways in human growth. Nat Genet 40 584 591
14. VoightBF
ScottLJ
SteinthorsdottirV
MorrisAP
DinaC
2010 Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis. Nat Genet 42 579 589
15. WillerCJ
SannaS
JacksonAU
ScuteriA
BonnycastleLL
2008 Newly identified loci that influence lipid concentrations and risk of coronary artery disease. Nat Genet 40 161 169
16. WeedonMN
LangoH
LindgrenCM
WallaceC
EvansDM
2008 Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet 40 575 583
17. ZegginiE
ScottLJ
SaxenaR
VoightBF
MarchiniJL
2008 Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet 40 638 645
18. ZhangX
HuangW
YangS
SunL
ZhangF
2009 Psoriasis genome-wide association study identifies susceptibility variants within LCE gene cluster at 1q21. Nat Genet 41 205 210
19. RaychaudhuriS
PlengeRM
RossinEJ
NgACY
PurcellSM
2009 Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet 5 e1000534 doi:10.1371/journal.pgen.1000534
20. WangK
LiM
BucanM
2007 Pathway-Based Approaches for Analysis of Genomewide Association Studies. Am J Hum Genet 81 Available at: http://www.ncbi.nlm.nih.gov/pubmed/17966091. Accessed 3 March 2010
21. SubramanianA
TamayoP
MoothaVK
MukherjeeS
EbertBL
2005 Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102 15545 15550
22. BrunnerHG
van DrielMA
2004 From syndrome families to functional genomics. Nat Rev Genet 5 545 551
23. D'AndreaAD
GrompeM
2003 The Fanconi anaemia/BRCA pathway. Nat Rev Cancer 3 23 34
24. LageK
KarlbergEO
StørlingZM
OlasonPI
PedersenAG
2007 A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol 25 309 316
25. LimJ
HaoT
ShawC
PatelAJ
SzabóG
2006 A Protein-Protein Interaction Network for Human Inherited Ataxias and Disorders of Purkinje Cell Degeneration. Cell 125 801 814
26. WalhoutAJ
ReboulJ
ShtankoO
BertinN
VaglioP
2002 Integrating Interactome, Phenome, and Transcriptome Mapping Data for the C. elegans Germline. Current Biology 12 1952 1958
27. LiL
ZhangK
LeeJ
CordesS
DavisDP
2009 Discovering cancer genes by integrating network and functional properties. BMC Med Genomics 2 61
28. SenguptaU
UkilS
DimitrovaN
AgrawalS
2009 Expression-based network biology identifies alteration in key regulatory pathways of type 2 diabetes and associated risk/complications. PLoS ONE 4 e8100 doi:10.1371/journal.pone.0008100
29. FrankeL
van BakelH
FokkensL
de JongED
Egmont-PetersenM
2006 Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. Am J Hum Genet 78 1011 1025
30. GentlemanR
HuberW
2007 Making the most of high-throughput protein-interaction data. Genome Biol 8 112
31. LageK
HansenNT
KarlbergEO
EklundAC
RoqueFS
2008 A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes. Proc Natl Acad Sci U S A 105 20870 20875
32. HuhW
FalvoJV
GerkeLC
CarrollAS
HowsonRW
2003 Global analysis of protein localization in budding yeast. Nature 425 686 691
33. ZieglerA
NepomGT
2010 Prediction and pathogenesis in type 1 diabetes. Immunity 32 468 478
34. BergholdtR
StørlingZM
LageK
KarlbergEO
OlasonPI
2007 Integrative analysis for finding genes and networks involved in diabetes and other complex diseases. Genome Biol 8 R253
35. WuG
ZhuL
DentJE
NardiniC
2010 A comprehensive molecular interaction map for rheumatoid arthritis. PLoS ONE 5 e10137 doi:10.1371/journal.pone.0010137
36. MoldovanG
D'AndreaAD
2009 How the fanconi anemia pathway guards the genome. Annu Rev Genet 43 223 249
37. StahlEA
RaychaudhuriS
RemmersEF
XieG
EyreS
2010 Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/20453842. Accessed 18 May 2010
38. FiresteinGS
2003 Evolving concepts of rheumatoid arthritis. Nature 423 356 361
39. AbrahamC
ChoJH
2009 Inflammatory Bowel Disease. N Engl J Med 361 2066 2078
40. AbrahamC
ChoJ
2009 Interleukin-23/Th17 pathways and inflammatory bowel disease. Inflamm Bowel Dis 15 1090 1100
41. BrandS
2009 Crohn's disease: Th1, Th17 or both? The change of a paradigm: new immunological and genetic insights implicate Th17 cells in the pathogenesis of Crohn's disease. Gut 58 1152 1167
42. ChoJH
2008 The genetics and immunopathogenesis of inflammatory bowel disease. Nat Rev Immunol 8 458 466
43. CriswellLA
2010 Gene discovery in rheumatoid arthritis highlights the CD40/NF-kappaB signaling pathway in disease pathogenesis. Immunol Rev 233 55 61
44. TakedaK
ClausenBE
KaishoT
TsujimuraT
TeradaN
1999 Enhanced Th1 Activity and Development of Chronic Enterocolitis in Mice Devoid of Stat3 in Macrophages and Neutrophils. Immunity 10 39 49
45. ZhangH
MasseyD
TremellingM
ParkesM
2008 Genetics of inflammatory bowel disease: clues to pathogenesis. Br Med Bull 87 17 30
46. BenitaY
CaoZ
GiallourakisC
LiC
GardetA
2010 Gene enrichment profiles reveal T cell development, differentiation and lineage specific transcription factors including ZBTB25 as a novel NF-AT repressor. Blood Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/20410506. Accessed 8 May 2010
47. FrankeA
McGovernDPB
BarrettJC
WangK
Radford-SmithGL
2010 Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet 42 1118 1125
48. LeeEG
BooneDL
ChaiS
LibbySL
ChienM
2000 Failure to Regulate TNF-Induced NF-kappa B and Cell Death Responses in A20-Deficient Mice. Science 289 2350 2354
49. MunroeME
BishopGA
2007 A Costimulatory Function for T Cell CD40. J Immunol 178 671 682
50. BottiniN
VangT
CuccaF
MustelinT
2006 Role of PTPN22 in type 1 diabetes and other autoimmune diseases. Semin Immunol 18 207 213
51. SmythDJ
PlagnolV
WalkerNM
CooperJD
DownesK
2008 Shared and distinct genetic variants in type 1 diabetes and celiac disease. N Engl J Med 359 2767 2777
52. KanoS
SatoK
MorishitaY
VollstedtS
KimS
2008 The contribution of transcription factor IRF1 to the interferon-gamma-interleukin 12 signaling axis and TH1 versus TH-17 differentiation of CD4+ T cells. Nat Immunol 9 34 41
53. BaderGD
BetelD
HogueCWV
2003 BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res 31 248 250
54. Chatr-aryamontriA
CeolA
PalazziLM
NardelliG
SchneiderMV
2007 MINT: the Molecular INTeraction database. Nucleic Acids Res 35 D572 574
55. Keshava PrasadTS
GoelR
KandasamyK
KeerthikumarS
KumarS
2009 Human Protein Reference Database–2009 update. Nucleic Acids Res 37 D767 772
56. BreitkreutzB
StarkC
RegulyT
BoucherL
BreitkreutzA
2008 The BioGRID Interaction Database: 2008 update. Nucleic Acids Res 36 D637 640
57. XenariosI
SalwínskiL
DuanXJ
HigneyP
KimS
2002 DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 30 303 305
58. MewesHW
DietmannS
FrishmanD
GregoryR
MannhauptG
2008 MIPS: analysis and annotation of genome information in 2007. Nucleic Acids Res 36 D196 201
59. KanehisaM
GotoS
2000 KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28 27 30
60. ArandaB
AchuthanP
Alam-FaruqueY
ArmeanI
BridgeA
2010 The IntAct molecular interaction database in 2010. Nucleic Acids Res 38 D525 531
61. D'EustachioP
2011 Reactome knowledgebase of human biological pathways and processes. Methods Mol Biol 694 49 61
62. The International HapMap Consortium 2005 A haplotype map of the human genome. Nature 437 1299 1320
63. FujitaPA
RheadB
ZweigAS
HinrichsAS
KarolchikD
2010 The UCSC Genome Browser database: update 2011. Nucleic Acids Res Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/20959295. Accessed 7 December 2010
64. VeyrierasJ
KudaravalliS
KimSY
DermitzakisET
GiladY
2008 High-Resolution Mapping of Expression-QTLs Yields Insight into Human Gene Regulation. PLoS Genet 4 e1000214 doi:10.1371/journal.pgen.1000214
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2011 Číslo 1
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- H3K9me-Independent Gene Silencing in Fission Yeast Heterochromatin by Clr5 and Histone Deacetylases
- Rnf12—A Jack of All Trades in X Inactivation?
- Joint Genetic Analysis of Gene Expression Data with Inferred Cellular Phenotypes
- Evolutionary Conserved Regulation of HIF-1β by NF-κB