#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology


Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.


Vyšlo v časopise: Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology. PLoS Genet 7(1): e32767. doi:10.1371/journal.pgen.1001273
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1001273

Souhrn

Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.


Zdroje

1. RaychaudhuriS

ThomsonBP

RemmersEF

EyreS

HinksA

2009 Genetic variants at CD28, PRDM1 and CD2/CD58 are associated with rheumatoid arthritis risk. Nat Genet 41 1313 1318

2. BarrettJC

HansoulS

NicolaeDL

ChoJH

DuerrRH

2008 Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat Genet 40 955 962

3. BarrettJC

ClaytonDG

ConcannonP

AkolkarB

CooperJD

2009 Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes. Nat Genet Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/19430480. Accessed 19 March 2010

4. BarrettJC

LeeJC

LeesCW

PrescottNJ

AndersonCA

2009 Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat Genet 41 1330 1334

5. De JagerPL

JiaX

WangJ

de BakkerPIW

OttoboniL

2009 Meta-analysis of genome scans and replication identify CD6, IRF8 and TNFRSF1A as new multiple sclerosis susceptibility loci. Nat Genet 41 776 782

6. GatevaV

SandlingJK

HomG

TaylorKE

ChungSA

2009 A large-scale replication study identifies TNIP1, PRDM1, JAZF1, UHRF1BP1 and IL10 as risk loci for systemic lupus erythematosus. Nat Genet 41 1228 1233

7. HuntKA

ZhernakovaA

TurnerG

HeapGAR

FrankeL

2008 Newly identified genetic risk variants for celiac disease related to the immune response. Nat Genet 40 395 402

8. RaychaudhuriS

2010 Recent advances in the genetics of rheumatoid arthritis. Curr Opin Rheumatol 22 109 118

9. DupuisJ

LangenbergC

ProkopenkoI

SaxenaR

SoranzoN

2010 New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 42 105 116

10. GudbjartssonDF

WaltersGB

ThorleifssonG

StefanssonH

HalldorssonBV

2008 Many sequence variants affecting diversity of adult human height. Nat Genet 40 609 615

11. KathiresanS

MelanderO

GuiducciC

SurtiA

BurttNP

2008 Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nat Genet 40 189 197

12. McCarthyMI

ZegginiE

2009 Genome-wide association studies in type 2 diabetes. Curr Diab Rep 9 164 171

13. LettreG

JacksonAU

GiegerC

SchumacherFR

BerndtSI

2008 Identification of ten loci associated with height highlights new biological pathways in human growth. Nat Genet 40 584 591

14. VoightBF

ScottLJ

SteinthorsdottirV

MorrisAP

DinaC

2010 Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis. Nat Genet 42 579 589

15. WillerCJ

SannaS

JacksonAU

ScuteriA

BonnycastleLL

2008 Newly identified loci that influence lipid concentrations and risk of coronary artery disease. Nat Genet 40 161 169

16. WeedonMN

LangoH

LindgrenCM

WallaceC

EvansDM

2008 Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet 40 575 583

17. ZegginiE

ScottLJ

SaxenaR

VoightBF

MarchiniJL

2008 Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet 40 638 645

18. ZhangX

HuangW

YangS

SunL

ZhangF

2009 Psoriasis genome-wide association study identifies susceptibility variants within LCE gene cluster at 1q21. Nat Genet 41 205 210

19. RaychaudhuriS

PlengeRM

RossinEJ

NgACY

PurcellSM

2009 Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet 5 e1000534 doi:10.1371/journal.pgen.1000534

20. WangK

LiM

BucanM

2007 Pathway-Based Approaches for Analysis of Genomewide Association Studies. Am J Hum Genet 81 Available at: http://www.ncbi.nlm.nih.gov/pubmed/17966091. Accessed 3 March 2010

21. SubramanianA

TamayoP

MoothaVK

MukherjeeS

EbertBL

2005 Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102 15545 15550

22. BrunnerHG

van DrielMA

2004 From syndrome families to functional genomics. Nat Rev Genet 5 545 551

23. D'AndreaAD

GrompeM

2003 The Fanconi anaemia/BRCA pathway. Nat Rev Cancer 3 23 34

24. LageK

KarlbergEO

StørlingZM

OlasonPI

PedersenAG

2007 A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol 25 309 316

25. LimJ

HaoT

ShawC

PatelAJ

SzabóG

2006 A Protein-Protein Interaction Network for Human Inherited Ataxias and Disorders of Purkinje Cell Degeneration. Cell 125 801 814

26. WalhoutAJ

ReboulJ

ShtankoO

BertinN

VaglioP

2002 Integrating Interactome, Phenome, and Transcriptome Mapping Data for the C. elegans Germline. Current Biology 12 1952 1958

27. LiL

ZhangK

LeeJ

CordesS

DavisDP

2009 Discovering cancer genes by integrating network and functional properties. BMC Med Genomics 2 61

28. SenguptaU

UkilS

DimitrovaN

AgrawalS

2009 Expression-based network biology identifies alteration in key regulatory pathways of type 2 diabetes and associated risk/complications. PLoS ONE 4 e8100 doi:10.1371/journal.pone.0008100

29. FrankeL

van BakelH

FokkensL

de JongED

Egmont-PetersenM

2006 Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. Am J Hum Genet 78 1011 1025

30. GentlemanR

HuberW

2007 Making the most of high-throughput protein-interaction data. Genome Biol 8 112

31. LageK

HansenNT

KarlbergEO

EklundAC

RoqueFS

2008 A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes. Proc Natl Acad Sci U S A 105 20870 20875

32. HuhW

FalvoJV

GerkeLC

CarrollAS

HowsonRW

2003 Global analysis of protein localization in budding yeast. Nature 425 686 691

33. ZieglerA

NepomGT

2010 Prediction and pathogenesis in type 1 diabetes. Immunity 32 468 478

34. BergholdtR

StørlingZM

LageK

KarlbergEO

OlasonPI

2007 Integrative analysis for finding genes and networks involved in diabetes and other complex diseases. Genome Biol 8 R253

35. WuG

ZhuL

DentJE

NardiniC

2010 A comprehensive molecular interaction map for rheumatoid arthritis. PLoS ONE 5 e10137 doi:10.1371/journal.pone.0010137

36. MoldovanG

D'AndreaAD

2009 How the fanconi anemia pathway guards the genome. Annu Rev Genet 43 223 249

37. StahlEA

RaychaudhuriS

RemmersEF

XieG

EyreS

2010 Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/20453842. Accessed 18 May 2010

38. FiresteinGS

2003 Evolving concepts of rheumatoid arthritis. Nature 423 356 361

39. AbrahamC

ChoJH

2009 Inflammatory Bowel Disease. N Engl J Med 361 2066 2078

40. AbrahamC

ChoJ

2009 Interleukin-23/Th17 pathways and inflammatory bowel disease. Inflamm Bowel Dis 15 1090 1100

41. BrandS

2009 Crohn's disease: Th1, Th17 or both? The change of a paradigm: new immunological and genetic insights implicate Th17 cells in the pathogenesis of Crohn's disease. Gut 58 1152 1167

42. ChoJH

2008 The genetics and immunopathogenesis of inflammatory bowel disease. Nat Rev Immunol 8 458 466

43. CriswellLA

2010 Gene discovery in rheumatoid arthritis highlights the CD40/NF-kappaB signaling pathway in disease pathogenesis. Immunol Rev 233 55 61

44. TakedaK

ClausenBE

KaishoT

TsujimuraT

TeradaN

1999 Enhanced Th1 Activity and Development of Chronic Enterocolitis in Mice Devoid of Stat3 in Macrophages and Neutrophils. Immunity 10 39 49

45. ZhangH

MasseyD

TremellingM

ParkesM

2008 Genetics of inflammatory bowel disease: clues to pathogenesis. Br Med Bull 87 17 30

46. BenitaY

CaoZ

GiallourakisC

LiC

GardetA

2010 Gene enrichment profiles reveal T cell development, differentiation and lineage specific transcription factors including ZBTB25 as a novel NF-AT repressor. Blood Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/20410506. Accessed 8 May 2010

47. FrankeA

McGovernDPB

BarrettJC

WangK

Radford-SmithGL

2010 Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet 42 1118 1125

48. LeeEG

BooneDL

ChaiS

LibbySL

ChienM

2000 Failure to Regulate TNF-Induced NF-kappa B and Cell Death Responses in A20-Deficient Mice. Science 289 2350 2354

49. MunroeME

BishopGA

2007 A Costimulatory Function for T Cell CD40. J Immunol 178 671 682

50. BottiniN

VangT

CuccaF

MustelinT

2006 Role of PTPN22 in type 1 diabetes and other autoimmune diseases. Semin Immunol 18 207 213

51. SmythDJ

PlagnolV

WalkerNM

CooperJD

DownesK

2008 Shared and distinct genetic variants in type 1 diabetes and celiac disease. N Engl J Med 359 2767 2777

52. KanoS

SatoK

MorishitaY

VollstedtS

KimS

2008 The contribution of transcription factor IRF1 to the interferon-gamma-interleukin 12 signaling axis and TH1 versus TH-17 differentiation of CD4+ T cells. Nat Immunol 9 34 41

53. BaderGD

BetelD

HogueCWV

2003 BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res 31 248 250

54. Chatr-aryamontriA

CeolA

PalazziLM

NardelliG

SchneiderMV

2007 MINT: the Molecular INTeraction database. Nucleic Acids Res 35 D572 574

55. Keshava PrasadTS

GoelR

KandasamyK

KeerthikumarS

KumarS

2009 Human Protein Reference Database–2009 update. Nucleic Acids Res 37 D767 772

56. BreitkreutzB

StarkC

RegulyT

BoucherL

BreitkreutzA

2008 The BioGRID Interaction Database: 2008 update. Nucleic Acids Res 36 D637 640

57. XenariosI

SalwínskiL

DuanXJ

HigneyP

KimS

2002 DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 30 303 305

58. MewesHW

DietmannS

FrishmanD

GregoryR

MannhauptG

2008 MIPS: analysis and annotation of genome information in 2007. Nucleic Acids Res 36 D196 201

59. KanehisaM

GotoS

2000 KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28 27 30

60. ArandaB

AchuthanP

Alam-FaruqueY

ArmeanI

BridgeA

2010 The IntAct molecular interaction database in 2010. Nucleic Acids Res 38 D525 531

61. D'EustachioP

2011 Reactome knowledgebase of human biological pathways and processes. Methods Mol Biol 694 49 61

62. The International HapMap Consortium 2005 A haplotype map of the human genome. Nature 437 1299 1320

63. FujitaPA

RheadB

ZweigAS

HinrichsAS

KarolchikD

2010 The UCSC Genome Browser database: update 2011. Nucleic Acids Res Available at: http://www.ncbi.nlm.nih.gov.ezp-prod1.hul.harvard.edu/pubmed/20959295. Accessed 7 December 2010

64. VeyrierasJ

KudaravalliS

KimSY

DermitzakisET

GiladY

2008 High-Resolution Mapping of Expression-QTLs Yields Insight into Human Gene Regulation. PLoS Genet 4 e1000214 doi:10.1371/journal.pgen.1000214

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2011 Číslo 1
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Aktuální možnosti diagnostiky a léčby litiáz
nový kurz
Autori: MUDr. Tomáš Ürge, PhD.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#