Patterns of Regulatory Variation in Diverse Human Populations
The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.
Vyšlo v časopise:
Patterns of Regulatory Variation in Diverse Human Populations. PLoS Genet 8(4): e32767. doi:10.1371/journal.pgen.1002639
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1002639
Souhrn
The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.
Zdroje
1. BustamanteCDFledel-AlonAWilliamsonSNielsenRHubiszMT 2005 Natural selection on protein-coding genes in the human genome. Nature 437 1153 1157
2. BoykoARWilliamsonSHIndapARDegenhardtJDHernandezRD 2008 Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet 4 e1000083 doi:10.1371/journal.pgen.1000083
3. KudaravalliSVeyrierasJBStrangerBEDermitzakisETPritchardJK 2009 Gene expression levels are a target of recent natural selection in the human genome. Mol Biol Evol 26 649 658
4. TorgersonDGBoykoARHernandezRDIndapAHuX 2009 Evolutionary processes acting on candidate cis-regulatory regions in humans inferred from patterns of polymorphism and divergence. PLoS Genet 5 e1000592 doi:10.1371/journal.pgen.1000592
5. NicaACMontgomerySBDimasASStrangerBEBeazleyC 2010 Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations. PLoS Genet 6 e1000895 doi:10.1371/journal.pgen.1000895
6. NicolaeDLGamazonEZhangWDuanSDolanME 2010 Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet 6 e1000888 doi:10.1371/journal.pgen.1000888
7. GamazonERNicolaeDLCoxNJ 2011 A study of CNVs as trait-associated polymorphisms and as expression quantitative trait loci. PLoS Genet 7 e1001292 doi:10.1371/journal.pgen.1001292
8. KingMCWilsonAC 1975 Evolution at two levels in humans and chimpanzees. Science 188 107 116
9. BlekhmanRMarioniJCZumboPStephensMGiladY 2010 Sex-specific and lineage-specific alternative splicing in primates. Genome Res 20 180 189
10. BlekhmanROshlackAGiladY 2009 Segmental duplications contribute to gene expression differences between humans and chimpanzees. Genetics 182 627 630
11. CheungVGSpielmanRSEwensKGWeberTMMorleyM 2005 Mapping determinants of human gene expression by regional and genome-wide association. Nature 437 1365 1369
12. MorleyMMolonyCMWeberTMDevlinJLEwensKG 2004 Genetic analysis of genome-wide variation in human gene expression. Nature 430 743 747
13. StrangerBEForrestMSClarkAGMinichielloMJDeutschS 2005 Genome-wide associations of gene expression variation in humans. PLoS Genet 1 e78 doi:10.1371/journal.pgen.0010078
14. StrangerBEForrestMSDunningMIngleCEBeazleyC 2007 Relative impact of nucleotide and copy number variation on gene expression phenotypes. Science 315 848 853
15. StrangerBENicaACForrestMSDimasABirdCP 2007 Population genomics of human gene expression. Nat Genet 39 1217 1224
16. EmilssonVThorleifssonGZhangBLeonardsonASZinkF 2008 Genetics of gene expression and its effect on disease. Nature 452 423 428
17. GrundbergEKwanTGeBLamKCKokaV 2009 Population genomics in a disease targeted primary cell model. Genome Res 19 1942 1952
18. MyersAJGibbsJRWebsterJARohrerKZhaoA 2007 A survey of genetic human cortical gene expression. Nat Genet 39 1494 1499
19. DimasASDeutschSStrangerBEMontgomerySBBorelC 2009 Common regulatory variation impacts gene expression in a cell type-dependent manner. Science 325 1246 1250
20. KwanTGrundbergEKokaVGeBLamKC 2009 Tissue effect on genetic control of transcript isoform variation. PLoS Genet 5 e1000608 doi:10.1371/journal.pgen.1000608
21. SpielmanRSBastoneLABurdickJTMorleyMEwensWJ 2007 Common genetic variants account for differences in gene expression among ethnic groups. Nat Genet 39 226 231
22. KuhnKBakerSCChudinELieuMHOeserS 2004 A novel, high-performance random array platform for quantitative gene expression profiling. Genome Res 14 2347 2356
23. BolstadBMIrizarryRAAstrandMSpeedTP 2003 A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19 185 193
24. PriceALPattersonNJPlengeRMWeinblattMEShadickNA 2006 Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38 904 909
25. PembertonTJWangCLiJZRosenbergNA 2010 Inference of unexpected genetic relatedness among individuals in HapMap Phase III. Am J Hum Genet 87 457 464
26. StegleOKannanADurbinRWinnJ 2008 Accounting for Non-genetic Factors Improves the Power of eQTL Studies. VingronMWongL 411 422 Research in Computational Molecular Biology: Springer Berlin / Heidelberg
27. StegleOPartsLDurbinRWinnJ 2010 A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies. PLoS Comput Biol 6 e1000770 doi:10.1371/journal.pcbi.1000770
28. RedonRIshikawaSFitchKRFeukLPerryGH 2006 Global variation in copy number in the human genome. Nature 444 444 454
29. MontgomerySBSammethMGutierrez-ArcelusMLachRPIngleC 2010 Transcriptome genetics using second generation sequencing in a Caucasian population. Nature 464 773 777
30. ChurchillGADoergeRW 1994 Empirical threshold values for quantitative trait mapping. Genetics 138 963 971
31. SpitzFGonzalezFPeichelCVogtTFDubouleD 2001 Large scale transgenic and cluster deletion analysis of the HoxD complex separate an ancestral regulatory module from evolutionary innovations. Genes Dev 15 2209 2214
32. MackayTFStoneEAAyrolesJF 2009 The genetics of quantitative traits: challenges and prospects. Nat Rev Genet 10 565 577
33. StrangerBEDermitzakisET 2006 From DNA to RNA to disease and back: the ‘central dogma’ of regulatory disease variation. Hum Genomics 2 383 390
34. MusunuruKStrongAFrank-KamenetskyMLeeNEAhfeldtT 2010 From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature 466 714 719
35. HindorffLASethupathyPJunkinsHARamosEMMehtaJP 2009 Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 106 9362 9367
36. HindorffLAJunkinsHAHallPNMehtaJPManolioTA A Catalog of Published Genome-Wide Association Studies
37. YangTPBeazleyCMontgomerySBDimasASGutierrez-ArcelusM 2010 Genevar: a database and Java application for the analysis and visualization of SNP-gene associations in eQTL studies. Bioinformatics 26 2474 2476
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2012 Číslo 4
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
Najčítanejšie v tomto čísle
- A Coordinated Interdependent Protein Circuitry Stabilizes the Kinetochore Ensemble to Protect CENP-A in the Human Pathogenic Yeast
- Coordinate Regulation of Lipid Metabolism by Novel Nuclear Receptor Partnerships
- Defective Membrane Remodeling in Neuromuscular Diseases: Insights from Animal Models
- Formation of Rigid, Non-Flight Forewings (Elytra) of a Beetle Requires Two Major Cuticular Proteins