Incorporating Motif Analysis into Gene Co-expression Networks Reveals Novel Modular Expression Pattern and New Signaling Pathways
Understanding of gene regulatory networks requires discovery of expression modules within gene co-expression networks and identification of promoter motifs and corresponding transcription factors that regulate their expression. A commonly used method for this purpose is a top-down approach based on clustering the network into a range of densely connected segments, treating these segments as expression modules, and extracting promoter motifs from these modules. Here, we describe a novel bottom-up approach to identify gene expression modules driven by known cis-regulatory motifs in the gene promoters. For a specific motif, genes in the co-expression network are ranked according to their probability of belonging to an expression module regulated by that motif. The ranking is conducted via motif enrichment or motif position bias analysis. Our results indicate that motif position bias analysis is an effective tool for genome-wide motif analysis. Sub-networks containing the top ranked genes are extracted and analyzed for inherent gene expression modules. This approach identified novel expression modules for the G-box, W-box, site II, and MYB motifs from an Arabidopsis thaliana gene co-expression network based on the graphical Gaussian model. The novel expression modules include those involved in house-keeping functions, primary and secondary metabolism, and abiotic and biotic stress responses. In addition to confirmation of previously described modules, we identified modules that include new signaling pathways. To associate transcription factors that regulate genes in these co-expression modules, we developed a novel reporter system. Using this approach, we evaluated MYB transcription factor-promoter interactions within MYB motif modules.
Vyšlo v časopise:
Incorporating Motif Analysis into Gene Co-expression Networks Reveals Novel Modular Expression Pattern and New Signaling Pathways. PLoS Genet 9(10): e32767. doi:10.1371/journal.pgen.1003840
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1003840
Souhrn
Understanding of gene regulatory networks requires discovery of expression modules within gene co-expression networks and identification of promoter motifs and corresponding transcription factors that regulate their expression. A commonly used method for this purpose is a top-down approach based on clustering the network into a range of densely connected segments, treating these segments as expression modules, and extracting promoter motifs from these modules. Here, we describe a novel bottom-up approach to identify gene expression modules driven by known cis-regulatory motifs in the gene promoters. For a specific motif, genes in the co-expression network are ranked according to their probability of belonging to an expression module regulated by that motif. The ranking is conducted via motif enrichment or motif position bias analysis. Our results indicate that motif position bias analysis is an effective tool for genome-wide motif analysis. Sub-networks containing the top ranked genes are extracted and analyzed for inherent gene expression modules. This approach identified novel expression modules for the G-box, W-box, site II, and MYB motifs from an Arabidopsis thaliana gene co-expression network based on the graphical Gaussian model. The novel expression modules include those involved in house-keeping functions, primary and secondary metabolism, and abiotic and biotic stress responses. In addition to confirmation of previously described modules, we identified modules that include new signaling pathways. To associate transcription factors that regulate genes in these co-expression modules, we developed a novel reporter system. Using this approach, we evaluated MYB transcription factor-promoter interactions within MYB motif modules.
Zdroje
1. BraunP, CarvunisAR, CharloteauxB, DrezeM, EckerJR, et al. (2011) Evidence for Network Evolution in an Arabidopsis Interactome Map. Science 333: 601–607.
2. FeistAM, HerrgardMJ, ThieleI, ReedJL, PalssonBO (2009) Reconstruction of biochemical networks in microorganisms. Nat Rev Microbiol 7: 129–143.
3. BarabasiAL, OltvaiZN (2004) Network biology: understanding the cell's functional organization. Nat Rev Genet 5: 101–113.
4. ChenJ, LalondeS, ObrdlikP, Noorani VataniA, ParsaSA, et al. (2012) Uncovering Arabidopsis membrane protein interactome enriched in transporters using mating-based split ubiquitin assays and classification models. Front Plant Sci 3: 124.
5. PopescuSC, PopescuGV, BachanS, ZhangZ, SeayM, et al. (2007) Differential binding of calmodulin-related proteins to their targets revealed through high-density Arabidopsis protein microarrays. Proc Natl Acad Sci U S A 104: 4730–4735.
6. BradySM, ZhangLF, MegrawM, MartinezNJ, JiangE, et al. (2011) A stele-enriched gene regulatory network in the Arabidopsis root. Molecular Systems Biology 7: 459.
7. GaudinierA, ZhangLF, Reece-HoyesJS, Taylor-TeeplesM, PuL, et al. (2011) Enhanced Y1H assays for Arabidopsis. Nature Methods 8: 1053–5.
8. PopescuSC, PopescuGV, BachanS, ZhangZ, GersteinM, et al. (2009) MAPK target networks in Arabidopsis thaliana revealed using functional protein microarrays. Genes Dev 23: 80–92.
9. MaoLY, Van HemertJL, DashS, DickersonJA (2009) Arabidopsis gene co-expression network and its functional modules. Bmc Bioinformatics 10: 346.
10. MentzenWI, WurteleES (2008) Regulon organization of Arabidopsis. BMC Plant Biol 8: 99.
11. ChildsKL, DavidsonRM, BuellCR (2011) Gene Coexpression Network Analysis as a Source of Functional Annotation for Rice Genes. PLoS One 6: e22196.
12. FukushimaA, NishizawaT, HayakumoM, HikosakaS, SaitoK, et al. (2012) Exploring Tomato Gene Functions Based on Coexpression Modules Using Graph Clustering and Differential Coexpression Approaches. Plant Physiology 158: 1487–1502.
13. ObayashiT, KinoshitaK (2010) Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways. Journal of Plant Research 123: 311–319.
14. UsadelB, ObayashiT, MutwilM, GiorgiFM, BasselGW, et al. (2009) Co-expression tools for plant biology: opportunities for hypothesis generation and caveats. Plant Cell and Environment 32: 1633–1651.
15. MaS, GongQ, BohnertHJ (2007) An Arabidopsis gene network based on the graphical Gaussian model. Genome Res 17: 1614–1625.
16. SchäferJ, StrimmerK (2005) A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Stat Appl Genet Mol Biol 4: Article32.
17. WilleA, ZimmermannP, VranovaE, FurholzA, LauleO, et al. (2004) Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana. Genome Biol 5: R92.
18. HeyndrickxKS, VandepoeleK (2012) Systematic Identification of Functional Plant Modules through the Integration of Complementary Data Sources. Plant Physiology 159: 884–901.
19. De BodtS, HollunderJ, NelissenH, MeulemeesterN, InzeD (2012) CORNET 2.0: integrating plant coexpression, protein-protein interactions, regulatory interactions, gene associations and functional annotations. New Phytologist 195: 707–720.
20. LeeI, SeoY-S, ColtraneD, HwangS, OhT, et al. (2011) Genetic dissection of the biotic stress response using a genome-scale gene network for rice. Proceedings of the National Academy of Sciences of the United States of America 108: 18548–18553.
21. LeeI, AmbaruB, ThakkarP, MarcotteEM, RheeSY (2010) Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana. Nature Biotechnology 28: 149–U114.
22. LysenkoA, Defoin-PlatelM, Hassani-PakK, TaubertJ, HodgmanC, et al. (2011) Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis. BMC Bioinformatics 12.
23. EnrightAJ, Van DongenS, OuzounisCA (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Research 30: 1575–1584.
24. MochidaK, Uehara-YamaguchiY, YoshidaT, SakuraiT, ShinozakiK (2011) Global landscape of a co-expressed gene network in barley and its application to gene discovery in Triticeae crops. Plant Cell Physiol 52: 785–803.
25. RuanJ, PerezJ, HernandezB, LeiC, SunterG, et al. (2011) Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana. BMC Bioinformatics 12 Suppl 12: S2.
26. MaS, BohnertHJ (2008) Gene networks in Arabidopsis thaliana for metabolic and environmental functions. Mol Biosyst 4: 199–204.
27. MaS, BachanS, PortoM, BohnertHJ, SnyderM, et al. (2012) Discovery of stress responsive DNA regulatory motifs in Arabidopsis. PLoS One 7: e43198.
28. JakobyM, WeisshaarB, Droge-LaserW, Vicente-CarbajosaJ, TiedemannJ, et al. (2002) bZIP transcription factors in Arabidopsis. Trends Plant Sci 7: 106–111.
29. ChoiH, HongJ, HaJ, KangJ, KimSY (2000) ABFs, a family of ABA-responsive element binding factors. J Biol Chem 275: 1723–1730.
30. UnoY, FurihataT, AbeH, YoshidaR, ShinozakiK, et al. (2000) Arabidopsis basic leucine zipper transcription factors involved in an abscisic acid-dependent signal transduction pathway under drought and high-salinity conditions. Proc Natl Acad Sci U S A 97: 11632–11637.
31. AlonsoR, Onate-SanchezL, WeltmeierF, EhlertA, DiazI, et al. (2009) A Pivotal Role of the Basic Leucine Zipper Transcription Factor bZIP53 in the Regulation of Arabidopsis Seed Maturation Gene Expression Based on Heterodimerization and Protein Complex Formation. Plant Cell 21: 1747–1761.
32. BensmihenS, GiraudatJ, ParcyF (2005) Characterization of three homologous basic leucine zipper transcription factors (bZIP) of the ABI5 family during Arabidopsis thaliana embryo maturation. Journal of Experimental Botany 56: 597–603.
33. ChenH, ZhangJ, NeffMM, HongS-W, ZhangH, et al. (2008) Integration of light and abscisic acid signaling during seed germination and early seedling development. Proceedings of the National Academy of Sciences of the United States of America 105: 4495–4500.
34. IwataY, KoizumiN (2005) An Arabidopsis transcription factor, AtbZIP60, regulates the endoplasmic reticulum stress response in a manner unique to plants. Proc Natl Acad Sci U S A 102: 5280–5285.
35. LiuJX, SrivastavaR, CheP, HowellSH (2007) An endoplasmic reticulum stress response in Arabidopsis is mediated by proteolytic processing and nuclear relocation of a membrane-associated transcription factor, bZIP28. Plant Cell 19: 4111–4119.
36. TajimaH, IwataY, IwanoM, TakayamaS, KoizumiN (2008) Identification of an Arabidopsis transmembrane bZIP transcription factor involved in the endoplasmic reticulum stress response. Biochem Biophys Res Commun 374: 242–247.
37. KimTH, HauserF, HaT, XueS, BohmerM, et al. (2011) Chemical genetics reveals negative regulation of abscisic acid signaling by a plant immune response pathway. Curr Biol 21: 990–997.
38. MaS, BohnertHJ (2007) Integration of Arabidopsis thaliana stress-related transcript profiles, promoter structures, and cell-specific expression. Genome Biol 8: R49.
39. Toledo-OrtizG, HuqE, QuailPH (2003) The Arabidopsis basic/helix-loop-helix transcription factor family. Plant Cell 15: 1749–1770.
40. HuqE, QuailPH (2002) PIF4, a phytochrome-interacting bHLH factor, functions as a negative regulator of phytochrome B signaling in Arabidopsis. EMBO J 21: 2441–2450.
41. Martinez-GarciaJF, HuqE, QuailPH (2000) Direct targeting of light signals to a promoter element-bound transcription factor. Science 288: 859–863.
42. AbeH, Yamaguchi-ShinozakiK, UraoT, IwasakiT, HosokawaD, et al. (1997) Role of arabidopsis MYC and MYB homologs in drought- and abscisic acid-regulated gene expression. Plant Cell 9: 1859–1868.
43. DombrechtB, XueGP, SpragueSJ, KirkegaardJA, RossJJ, et al. (2007) MYC2 differentially modulates diverse jasmonate-dependent functions in Arabidopsis. Plant Cell 19: 2225–2245.
44. YadavV, MallappaC, GangappaSN, BhatiaS, ChattopadhyayS (2005) A basic helix-loop-helix transcription factor in Arabidopsis, MYC2, acts as a repressor of blue light-mediated photomorphogenic growth. Plant Cell 17: 1953–1966.
45. ZimmermannP, Hirsch-HoffmannM, HennigL, GruissemW (2004) GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox. Plant Physiol 136: 2621–2632.
46. AndersonJP, BadruzsaufariE, SchenkPM, MannersJM, DesmondOJ, et al. (2004) Antagonistic interaction between abscisic acid and jasmonate-ethylene signaling pathways modulates defense gene expression and disease resistance in Arabidopsis. Plant Cell 16: 3460–3479.
47. NaharK, KyndtT, NzogelaYB, GheysenG (2012) Abscisic acid interacts antagonistically with classical defense pathways in rice-migratory nematode interaction. New Phytol 196: 901–913.
48. DubosC, StrackeR, GrotewoldE, WeisshaarB, MartinC, et al. (2010) MYB transcription factors in Arabidopsis. Trends Plant Sci 15: 573–581.
49. FellerA, MachemerK, BraunEL, GrotewoldE (2011) Evolutionary and comparative analysis of MYB and bHLH plant transcription factors. Plant Journal 66: 94–116.
50. GrotewoldE, DrummondBJ, BowenB, PetersonT (1994) The myb-homologous P gene controls phlobaphene pigmentation in maize floral organs by directly activating a flavonoid biosynthetic gene subset. Cell 76: 543–553.
51. SablowskiRW, MoyanoE, Culianez-MaciaFA, SchuchW, MartinC, et al. (1994) A flower-specific Myb protein activates transcription of phenylpropanoid biosynthetic genes. EMBO J 13: 128–137.
52. BorevitzJO, XiaY, BlountJ, DixonRA, LambC (2000) Activation tagging identifies a conserved MYB regulator of phenylpropanoid biosynthesis. Plant Cell 12: 2383–2394.
53. GigolashviliT, BergerB, MockHP, MullerC, WeisshaarB, et al. (2007) The transcription factor HIG1/MYB51 regulates indolic glucosinolate biosynthesis in Arabidopsis thaliana. Plant J 50: 886–901.
54. GigolashviliT, YatusevichR, BergerB, MullerC, FluggeUI (2007) The R2R3-MYB transcription factor HAG1/MYB28 is a regulator of methionine-derived glucosinolate biosynthesis in Arabidopsis thaliana. Plant J 51: 247–261.
55. HiraiMY, SugiyamaK, SawadaY, TohgeT, ObayashiT, et al. (2007) Omics-based identification of Arabidopsis Myb transcription factors regulating aliphatic glucosinolate biosynthesis. Proc Natl Acad Sci U S A 104: 6478–6483.
56. ImamuraS, KanesakiY, OhnumaM, InouyeT, SekineY, et al. (2009) R2R3-type MYB transcription factor, CmMYB1, is a central nitrogen assimilation regulator in Cyanidioschyzon merolae. Proc Natl Acad Sci U S A 106: 12548–12553.
57. KoesR, VerweijW, QuattrocchioF (2005) Flavonoids: a colorful model for the regulation and evolution of biochemical pathways. Trends Plant Sci 10: 236–242.
58. WangR, GuanP, ChenM, XingX, ZhangY, et al. (2010) Multiple regulatory elements in the Arabidopsis NIA1 promoter act synergistically to form a nitrate enhancer. Plant Physiol 154: 423–432.
59. ZhongR, LeeC, ZhouJ, McCarthyRL, YeZH (2008) A battery of transcription factors involved in the regulation of secondary cell wall biosynthesis in Arabidopsis. Plant Cell 20: 2763–2782.
60. ZhouJ, LeeC, ZhongR, YeZH (2009) MYB58 and MYB63 are transcriptional activators of the lignin biosynthetic pathway during secondary cell wall formation in Arabidopsis. Plant Cell 21: 248–266.
61. EulgemT, RushtonPJ, RobatzekS, SomssichIE (2000) The WRKY superfamily of plant transcription factors. Trends Plant Sci 5: 199–206.
62. CiolkowskiI, WankeD, BirkenbihlRP, SomssichIE (2008) Studies on DNA-binding selectivity of WRKY transcription factors lend structural clues into WRKY-domain function. Plant Mol Biol 68: 81–92.
63. DevaiahBN, KarthikeyanAS, RaghothamaKG (2007) WRKY75 transcription factor is a modulator of phosphate acquisition and root development in Arabidopsis. Plant Physiol 143: 1789–1801.
64. KosugiS, OhashiY (1997) PCF1 and PCF2 specifically bind to cis elements in the rice proliferating cell nuclear antigen gene. Plant Cell 9: 1607–1619.
65. KosugiS, SuzukaI, OhashiY (1995) Two of three promoter elements identified in a rice gene for proliferating cell nuclear antigen are essential for meristematic tissue-specific expression. Plant Journal 7: 877–886.
66. TremousaygueD, GarnierL, BardetC, DabosP, HerveC, et al. (2003) Internal telomeric repeats and ‘TCP domain’ protein-binding sites co-operate to regulate gene expression in Arabidopsis thaliana cycling cells. Plant Journal 33: 957–966.
67. WelchenE, GonzalezDH (2005) Differential expression of the Arabidopsis cytochrome c genes Cytc-1 and Cytc-2. Evidence for the involvement of TCP-domain protein-binding elements in anther- and meristem-specific expression of the Cytc-1 gene. Plant Physiol 139: 88–100.
68. HortensteinerS (2009) Stay-green regulates chlorophyll and chlorophyll-binding protein degradation during senescence. Trends Plant Sci 14: 155–162.
69. StarkA, LinMF, KheradpourP, PedersenJS, PartsL, et al. (2007) Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures. Nature 450: 219–232.
70. WalleyJW, CoughlanS, HudsonME, CovingtonMF, KaspiR, et al. (2007) Mechanical stress induces biotic and abiotic stress responses via a novel cis-element. PLoS Genet 3: 1800–1812.
71. HellensRP, AllanAC, FrielEN, BolithoK, GraftonK, et al. (2005) Transient expression vectors for functional genomics, quantification of promoter activity and RNA silencing in plants. Plant Methods 1: 13.
72. BhuiyanNH, SelvarajG, WeiY, KingJ (2009) Gene expression profiling and silencing reveal that monolignol biosynthesis plays a critical role in penetration defence in wheat against powdery mildew invasion. J Exp Bot 60: 509–521.
73. TrumanW, de ZabalaMT, GrantM (2006) Type III effectors orchestrate a complex interplay between transcriptional networks to modify basal defence responses during pathogenesis and resistance. Plant J 46: 14–33.
74. BednarekP, SchneiderB, SvatosA, OldhamNJ, HahlbrockK (2005) Structural complexity, differential response to infection, and tissue specificity of indolic and phenylpropanoid secondary metabolism in Arabidopsis roots. Plant Physiol 138: 1058–1070.
75. ZhaoQ, DixonRA (2011) Transcriptional networks for lignin biosynthesis: more complex than we thought? Trends Plant Sci 16: 227–233.
76. VandepoeleK, QuimbayaM, CasneufT, De VeylderL, Van de PeerY (2009) Unraveling transcriptional control in Arabidopsis using cis-regulatory elements and coexpression networks. Plant Physiol 150: 535–546.
77. YokoyamaKD, OhlerU, WrayGA (2009) Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships. Nucleic Acids Research 37: e92.
78. VardhanabhutiS, WangJW, HannenhalliS (2007) Position and distance specificity are important determinants of cis-regulatory motifs in addition to evolutionary conservation. Nucleic Acids Research 35: 3203–3213.
79. ElementoO, SlonimN, TavazoieS (2007) A universal framework for regulatory element discovery across all genomes and data types. Mol Cell 28: 337–350.
80. LinhartC, HalperinY, ShamirR (2008) Transcription factor and microRNA motif discovery: the Amadeus platform and a compendium of metazoan target sets. Genome Res 18: 1180–1189.
81. SchäferJ, Opgen-RheinR, StrimmerK (2006) Reverse Engineering Genetic Networks using the GeneNet Package. R News 6/5: 50–53.
82. PalaniswamySK, JamesS, SunH, LambRS, DavuluriRV, et al. (2006) AGRIS and AtRegNet. a platform to link cis-regulatory elements and transcription factors into regulatory networks. Plant Physiol 140: 818–829.
83. HigoK, UgawaY, IwamotoM, HigoH (1998) PLACE: a database of plant cis-acting regulatory DNA elements. Nucleic Acids Res 26: 358–359.
84. GansnerER, KorenY, NorthS (2004) Graph drawing by stress majorization. Graph Drawing 3383: 239–250.
85. GansnerER, NorthSC (2000) An open graph visualization system and its applications to software engineering. Software-Practice & Experience 30: 1203–1233.
86. SchmidM, DavisonTS, HenzSR, PapeUJ, DemarM, et al. (2005) A gene expression map of Arabidopsis thaliana development. Nat Genet 37: 501–506.
87. KilianJ, WhiteheadD, HorakJ, WankeD, WeinlS, et al. (2007) The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. Plant J 50: 347–363.
88. KarimiM, De MeyerB, HilsonP (2005) Modular cloning in plant cells. Trends Plant Sci 10: 103–105.
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2013 Číslo 10
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Dominant Mutations in Identify the Mlh1-Pms1 Endonuclease Active Site and an Exonuclease 1-Independent Mismatch Repair Pathway
- Eleven Candidate Susceptibility Genes for Common Familial Colorectal Cancer
- The Histone H3 K27 Methyltransferase KMT6 Regulates Development and Expression of Secondary Metabolite Gene Clusters
- A Mutation in the Gene in Labrador Retrievers with Hereditary Nasal Parakeratosis (HNPK) Provides Insights into the Epigenetics of Keratinocyte Differentiation