Phenotype Ontologies and Cross-Species Analysis for Translational Research
The use of model organisms as tools for the investigation of human genetic variation has significantly and rapidly advanced our understanding of the aetiologies underlying hereditary traits. However, while equivalences in the DNA sequence of two species may be readily inferred through evolutionary models, the identification of equivalence in the phenotypic consequences resulting from comparable genetic variation is far from straightforward, limiting the value of the modelling paradigm. In this review, we provide an overview of the emerging statistical and computational approaches to objectively identify phenotypic equivalence between human and model organisms with examples from the vertebrate models, mouse and zebrafish. Firstly, we discuss enrichment approaches, which deem the most frequent phenotype among the orthologues of a set of genes associated with a common human phenotype as the orthologous phenotype, or phenolog, in the model species. Secondly, we introduce and discuss computational reasoning approaches to identify phenotypic equivalences made possible through the development of intra- and interspecies ontologies. Finally, we consider the particular challenges involved in modelling neuropsychiatric disorders, which illustrate many of the remaining difficulties in developing comprehensive and unequivocal interspecies phenotype mappings.
Vyšlo v časopise:
Phenotype Ontologies and Cross-Species Analysis for Translational Research. PLoS Genet 10(4): e32767. doi:10.1371/journal.pgen.1004268
Kategorie:
Review
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1004268
Souhrn
The use of model organisms as tools for the investigation of human genetic variation has significantly and rapidly advanced our understanding of the aetiologies underlying hereditary traits. However, while equivalences in the DNA sequence of two species may be readily inferred through evolutionary models, the identification of equivalence in the phenotypic consequences resulting from comparable genetic variation is far from straightforward, limiting the value of the modelling paradigm. In this review, we provide an overview of the emerging statistical and computational approaches to objectively identify phenotypic equivalence between human and model organisms with examples from the vertebrate models, mouse and zebrafish. Firstly, we discuss enrichment approaches, which deem the most frequent phenotype among the orthologues of a set of genes associated with a common human phenotype as the orthologous phenotype, or phenolog, in the model species. Secondly, we introduce and discuss computational reasoning approaches to identify phenotypic equivalences made possible through the development of intra- and interspecies ontologies. Finally, we consider the particular challenges involved in modelling neuropsychiatric disorders, which illustrate many of the remaining difficulties in developing comprehensive and unequivocal interspecies phenotype mappings.
Zdroje
1. SchofieldPN, HoehndorfR, GkoutosGV (2012) Mouse genetic and phenotypic resources for human genetics. Hum Mutat 33: 826–836.
2. MohunT, AdamsDJ, BaldockR, BhattacharyaS, CoppAJ, et al. (2013) Deciphering the Mechanisms of Developmental Disorders (DMDD): a new programme for phenotyping embryonic lethal mice. Dis Model Mech 6: 562–566.
3. AyadiA, BirlingMC, BottomleyJ, BussellJ, FuchsH, et al. (2012) Mouse large-scale phenotyping initiatives: overview of the European Mouse Disease Clinic (EUMODIC) and of the Wellcome Trust Sanger Institute Mouse Genetics Project. Mamm Genome 23: 600–610.
4. BrownSD, MooreMW (2012) The International Mouse Phenotyping Consortium: past and future perspectives on mouse phenotyping. Mamm Genome 23: 632–640.
5. DelpratoA, AransayAM, KollmusH, SchughartK, Falcon-PerezJM (2013) Meeting report of the European mouse complex genetics network SYSGENET. Mamm Genome 24: 190–197.
6. MorganH, SimonM, MallonAM (2012) Accessing and mining data from large-scale mouse phenotyping projects. Int Rev Neurobiol 104: 47–70.
7. KettleboroughRN, Busch-NentwichEM, HarveySA, DooleyCM, de BruijnE, et al. (2013) A systematic genome-wide analysis of zebrafish protein-coding gene function. Nature 496: 494–497.
8. WhiteJK, GerdinAK, KarpNA, RyderE, BuljanM, et al. (2013) Genome-wide generation and systematic phenotyping of knockout mice reveals new roles for many genes. Cell 154: 452–464.
9. RobinsonPN (2012) Deep phenotyping for precision medicine. Hum Mutat 33: 777–780.
10. AltenhoffAM, StuderRA, Robinson-RechaviM, DessimozC (2012) Resolving the ortholog conjecture: orthologs tend to be weakly, but significantly, more similar in function than paralogs. PLOS Comput Biol 8: e1002514.
11. ChenX, ZhangJ (2012) The ortholog conjecture is untestable by the current gene ontology but is supported by RNA sequencing data. PLOS Comput Biol 8: e1002784.
12. KuehnMR, BradleyA, RobertsonEJ, EvansMJ (1987) A potential animal model for Lesch-Nyhan syndrome through introduction of HPRT mutations into mice. Nature 326: 295–298.
13. BulfieldG, SillerWG, WightPA, MooreKJ (1984) X chromosome-linked muscular dystrophy (mdx) in the mouse. Proc Natl Acad Sci U S A 81: 1189–1192.
14. SeokJ, WarrenHS, CuencaAG, MindrinosMN, BakerHV, et al. (2013) Genomic responses in mouse models poorly mimic human inflammatory diseases. Proc Natl Acad Sci U S A 110: 3507–3512.
15. BultCJ, EppigJT, BlakeJA, KadinJA, RichardsonJE (2013) The mouse genome database: genotypes, phenotypes, and models of human disease. Nucleic acids research 41: D885–891.
16. KelleyBP, SharanR, KarpRM, SittlerT, RootDE, et al. (2003) Conserved pathways within bacteria and yeast as revealed by global protein network alignment. Proc Natl Acad Sci U S A 100: 11394–11399.
17. OtiM, BrunnerHG (2007) The modular nature of genetic diseases. Clinical genetics 71: 1–11.
18. McGaryKL, ParkTJ, WoodsJO, ChaHJ, WallingfordJB, et al. (2010) Systematic discovery of nonobvious human disease models through orthologous phenotypes. Proc Natl Acad Sci U S A 107: 6544–6549.
19. ShaikhTH, Haldeman-EnglertC, GeigerEA, PontingCP, WebberC (2011) Genes and biological processes commonly disrupted in rare and heterogeneous developmental delay syndromes. Hum Mol Genet 20: 880–893.
20. WebberC, Hehir-KwaJY, NguyenDQ, de VriesBB, VeltmanJA, et al. (2009) Forging links between human mental retardation-associated CNVs and mouse gene knockout models. PLOS Genet 5: e1000531.
21. HoenigK, HochreinA, QuednowBB, MaierW, WagnerM (2005) Impaired prepulse inhibition of acoustic startle in obsessive-compulsive disorder. Biol Psychiatry 57: 1153–1158.
22. BouldingH, WebberC (2012) Large-scale objective association of mouse phenotypes with human symptoms through structural variation identified in patients with developmental disorders. Hum Mutat 33: 874–883.
23. GruberTT (1993) A translation approach to portable ontologies. Knowledge Acquisition 5: 199–220.
24. DegtyarenkoK, de MatosP, EnnisM, HastingsJ, ZbindenM, et al. (2008) ChEBI: a database and ontology for chemical entities of biological interest. Nucleic acids research 36: D344–350.
25. The Gene Ontology Consortium (2010) The Gene Ontology in 2010: extensions and refinements. Nucleic Acids Res 38: D331–335.
26. RobinsonPN, KöhlerS, BauerS, SeelowD, HornD, et al. (2008) The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. Am J Hum Genet 83: 610–615.
27. KöhlerS, DoelkenSC, MungallCJ, BauerS, FirthHV, et al. (2014) The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data. Nucleic Acids Res 42: D966–974.
28. SmithCL, EppigJT (2012) The Mammalian Phenotype Ontology as a unifying standard for experimental and high-throughput phenotyping data. Mamm Genome 23: 653–668.
29. SmithCL, GoldsmithCA, EppigJT (2005) The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information. Genome Biol 6: R7.
30. EppigJT, BlakeJA, BultCJ, KadinJA, RichardsonJE (2012) The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse. Nucleic Acids Res 40: D881–886.
31. BelloSM, RichardsonJE, DavisAP, WiegersTC, MattinglyCJ, et al. (2012) Disease model curation improvements at Mouse Genome Informatics. Database (Oxford) 2012: bar063.
32. WashingtonNL, HaendelMA, MungallCJ, AshburnerM, WesterfieldM, et al. (2009) Linking human diseases to animal models using ontology-based phenotype annotation. PLOS Biol 7: e1000247.
33. GkoutosGV, MungallC, DolkenS, AshburnerM, LewisS, et al. (2009) Entity/quality-based logical definitions for the human skeletal phenome using PATO. Conf Proc IEEE Eng Med Biol Soc 2009: 7069–7072.
34. MungallCJ, GkoutosGV, SmithCL, HaendelMA, LewisSE, et al. (2010) Integrating phenotype ontologies across multiple species. Genome Biol 11: R2.
35. RosseC, MejinoJLJr (2003) A reference ontology for biomedical informatics: the Foundational Model of Anatomy. J Biomed Inform 36: 478–500.
36. AshburnerM, BallCA, BlakeJA, BotsteinD, ButlerH, et al. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25: 25–29.
37. BardJ, RheeSY, AshburnerM (2005) An ontology for cell types. Genome Biol 6: R21.
38. NataleDA, ArighiCN, BarkerWC, BlakeJA, BultCJ, et al. (2011) The Protein Ontology: a structured representation of protein forms and complexes. Nucleic Acids Res 39: D539–545.
39. SchofieldPN, SundbergJP, SundbergBA, McKerlieC, GkoutosGV (2013) The mouse pathology ontology, MPATH; structure and applications. J Biomed Semantics 4: 18.
40. MungallCJ, TorniaiC, GkoutosGV, LewisSE, HaendelMA (2012) Uberon, an integrative multi-species anatomy ontology. Genome Biol 13: R5.
41. SmithB, AshburnerM, RosseC, BardJ, BugW, et al. (2007) The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 25: 1251–1255.
42. KöhlerS, DoelkenSC, RuefBJ, BauerS, WashingtonN, et al. (2013) Construction and accessibility of a cross-species phenotype ontology along with gene annotations for biomedical research. F1000Res 2: 30.
43. DoelkenSC, KohlerS, MungallCJ, GkoutosGV, RuefBJ, et al. (2013) Phenotypic overlap in the contribution of individual genes to CNV pathogenicity revealed by cross-species computational analysis of single-gene mutations in humans, mice and zebrafish. Dis Model Mech 6: 358–372.
44. CollierN, OellrichA, GrozaT (2013) Toward knowledge support for analysis and interpretation of complex traits. Genome Biol 14: 214.
45. HoehndorfR, SchofieldPN, GkoutosGV (2011) PhenomeNET: a whole-phenome approach to disease gene discovery. Nucleic Acids Res 39: e119.
46. SmedleyD, OellrichA, KohlerS, RuefB, WesterfieldM, et al. (2013) PhenoDigm: analyzing curated annotations to associate animal models with human diseases. Database (Oxford) 2013: bat025.
47. TassyO, PourquieO (2014) Manteia, a predictive data mining system for vertebrate genes and its applications to human genetic diseases. Nucleic Acids Res 42: D882–891.
48. PereiraL, LeeSY, GayraudB, AndrikopoulosK, ShapiroSD, et al. (1999) Pathogenetic sequence for aneurysm revealed in mice underexpressing fibrillin-1. Proc Natl Acad Sci U S A 96: 3819–3823.
49. PesquitaC, FariaD, FalcaoAO, LordP, CoutoFM (2009) Semantic similarity in biomedical ontologies. PLOS Comput Biol 5: e1000443.
50. BauerS, KohlerS, SchulzMH, RobinsonPN (2012) Bayesian ontology querying for accurate and noise-tolerant semantic searches. Bioinformatics 28: 2502–2508.
51. RobinsonPN, KohlerS, OellrichA, WangK, MungallCJ, et al. (2014) Improved exome prioritization of disease genes through cross-species phenotype comparison. Genome Res 24: 340–348.
52. TandonR, GaebelW, BarchDM, BustilloJ, GurRE, et al. (2013) Definition and description of schizophrenia in the DSM-5. Schizophr Res 150: 3–10.
53. NohHJ, PontingCP, BouldingHC, MeaderS, BetancurC, et al. (2013) Network topologies and convergent aetiologies arising from deletions and duplications observed in individuals with autism. PLOS Genet 9: e1003523.
54. American Psychiatric Association (2013) Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Arlington: American Psychiatric Publishing. 991 p.
55. CraddockN, OwenMJ (2010) The Kraepelinian dichotomy - going, going… but still not gone. Br J Psychiatry 196: 92–95.
56. LichtensteinP, YipBH, BjorkC, PawitanY, CannonTD, et al. (2009) Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet 373: 234–239.
57. SmollerJW, CraddockN, KendlerK, LeePH, NealeBM, et al. (2013) Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet 381: 1371–1379.
58. AdamD (2013) Mental health: On the spectrum. Nature 496: 416–418.
59. Cross-Disorder Group of the Psychiatric Genomics Consortium, Genetic Risk Outcome of Psychosis (GROUP) Consortium (2013) Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet 381: 1371–1379.
60. NumakawaT, AdachiN, RichardsM, ChibaS, KunugiH (2013) Brain-derived neurotrophic factor and glucocorticoids: reciprocal influence on the central nervous system. Neuroscience 239: 157–172.
61. ZivL, MutoA, SchoonheimPJ, MeijsingSH, StrasserD, et al. (2013) An affective disorder in zebrafish with mutation of the glucocorticoid receptor. Mol Psychiatry 18: 681–691.
62. FlintJ, MunafoMR (2007) The endophenotype concept in psychiatric genetics. Psychol Med 37: 163–180.
63. Gottesman, II, GouldTD (2003) The endophenotype concept in psychiatry: etymology and strategic intentions. Am J Psychiatry 160: 636–645.
64. GreenwoodTA, LightGA, SwerdlowNR, RadantAD, BraffDL (2012) Association analysis of 94 candidate genes and schizophrenia-related endophenotypes. PLOS ONE 7: e29630.
65. GreenwoodTA, SwerdlowNR, GurRE, CadenheadKS, CalkinsME, et al. (2013) Genome-wide linkage analyses of 12 endophenotypes for schizophrenia from the Consortium on the Genetics of Schizophrenia. Am J Psychiatry 170: 521–532.
66. SandersonDJ, BannermanDM (2012) The role of habituation in hippocampus-dependent spatial working memory tasks: evidence from GluA1 AMPA receptor subunit knockout mice. Hippocampus 22: 981–994.
67. BraffDL, GeyerMA (1990) Sensorimotor gating and schizophrenia. Human and animal model studies. Arch Gen Psychiatry 47: 181–188.
68. NasonMWJr, AdhikariA, BozinoskiM, GordonJA, RoleLW (2011) Disrupted activity in the hippocampal-accumbens circuit of type III neuregulin 1 mutant mice. Neuropsychopharmacology 36: 488–496.
69. FanousAH, ZhaoZ, van den OordEJ, MaherBS, ThiseltonDL, et al. (2010) Association study of SNAP25 and schizophrenia in Irish family and case-control samples. Am J Med Genet B Neuropsychiatr Genet 153B: 663–674.
70. OliverPL, SobczykMV, MaywoodES, EdwardsB, LeeS, et al. (2012) Disrupted circadian rhythms in a mouse model of schizophrenia. Curr Biol 22: 314–319.
71. NithianantharajahJ, KomiyamaNH, McKechanieA, JohnstoneM, BlackwoodDH, et al. (2013) Synaptic scaffold evolution generated components of vertebrate cognitive complexity. Nat Neurosci 16: 16–24.
72. SarnyaiZ, AlsaifM, BahnS, ErnstA, GuestPC, et al. (2011) Behavioral and molecular biomarkers in translational animal models for neuropsychiatric disorders. Int Rev Neurobiol 101: 203–238.
73. McKusickVA (2001) The anatomy of the human genome: a neo-Vesalian basis for medicine in the 21st century. JAMA 286: 2289–2295.
74. MallonAM, IyerV, MelvinD, MorganH, ParkinsonH, et al. (2012) Accessing data from the International Mouse Phenotyping Consortium: state of the art and future plans. Mamm Genome 23: 641–652.
75. HoweDG, BradfordYM, ConlinT, EagleAE, FashenaD, et al. (2013) ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics. Nucleic Acids Res 41: D854–860.
76. ProsserHM, Koike-YusaH, CooperJD, LawFC, BradleyA (2011) A resource of vectors and ES cells for targeted deletion of microRNAs in mice. Nat Biotechnol 29: 840–845.
77. HayamizuTF, ManganM, CorradiJP, KadinJA, RingwaldM (2005) The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data. Genome Biol 6: R29.
78. ResnikP (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Artificial Intelligence Research 11: 95–130.
79. CoutoF, SilvaM, CoutinhoP (2007) Measuring Semantic Similarity between Gene Ontology Terms. Data and Knowledge Engineering 61: 137–152.
80. FireA, XuS, MontgomeryMK, KostasSA, DriverSE, et al. (1998) Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature 391: 806–811.
81. HarrisTW, AntoshechkinI, BieriT, BlasiarD, ChanJ, et al. (2010) WormBase: a comprehensive resource for nematode research. Nucleic Acids Res 38: D463–467.
82. St PierreSE, PontingL, StefancsikR, McQuiltonP (2014) FlyBase 102–advanced approaches to interrogating FlyBase. Nucleic Acids Res 42: D780–788.
83. SchindelmanG, FernandesJS, BastianiCA, YookK, SternbergPW (2011) Worm Phenotype Ontology: integrating phenotype data within and beyond the C. elegans community. BMC Bioinformatics 12: 32.
84. Osumi-SutherlandD, MarygoldSJ, MillburnGH, McQuiltonPA, PontingL, et al. (2013) The Drosophila phenotype ontology. J Biomed Semantics 4: 30.
85. LaulederkindSJ, LiuW, SmithJR, HaymanGT, WangSJ, et al. (2013) PhenoMiner: quantitative phenotype curation at the rat genome database. Database (Oxford) 2013: bat015.
86. HarrisMA, LockA, BahlerJ, OliverSG, WoodV (2013) FYPO: the fission yeast phenotype ontology. Bioinformatics 29: 1671–1678.
87. MabeeP, DeansA, HualaE, LewisSE (2012) Phenotype Ontology Research Coordination Network meeting report: creating a community network for comparing and leveraging phenotype-genotype knowledge across species. Stand Genomic Sci 6: 440–443.
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2014 Číslo 4
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- The Sequence-Specific Transcription Factor c-Jun Targets Cockayne Syndrome Protein B to Regulate Transcription and Chromatin Structure
- Genetic Predisposition to In Situ and Invasive Lobular Carcinoma of the Breast
- Widespread Use of Non-productive Alternative Splice Sites in
- RNA Editome in Rhesus Macaque Shaped by Purifying Selection