#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Predicting Mendelian Disease-Causing Non-Synonymous Single Nucleotide Variants in Exome Sequencing Studies


Exome sequencing is becoming a standard tool for mapping Mendelian disease-causing (or pathogenic) non-synonymous single nucleotide variants (nsSNVs). Minor allele frequency (MAF) filtering approach and functional prediction methods are commonly used to identify candidate pathogenic mutations in these studies. Combining multiple functional prediction methods may increase accuracy in prediction. Here, we propose to use a logit model to combine multiple prediction methods and compute an unbiased probability of a rare variant being pathogenic. Also, for the first time we assess the predictive power of seven prediction methods (including SIFT, PolyPhen2, CONDEL, and logit) in predicting pathogenic nsSNVs from other rare variants, which reflects the situation after MAF filtering is done in exome-sequencing studies. We found that a logit model combining all or some original prediction methods outperforms other methods examined, but is unable to discriminate between autosomal dominant and autosomal recessive disease mutations. Finally, based on the predictions of the logit model, we estimate that an individual has around 5% of rare nsSNVs that are pathogenic and carries ∼22 pathogenic derived alleles at least, which if made homozygous by consanguineous marriages may lead to recessive diseases.


Vyšlo v časopise: Predicting Mendelian Disease-Causing Non-Synonymous Single Nucleotide Variants in Exome Sequencing Studies. PLoS Genet 9(1): e32767. doi:10.1371/journal.pgen.1003143
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1003143

Souhrn

Exome sequencing is becoming a standard tool for mapping Mendelian disease-causing (or pathogenic) non-synonymous single nucleotide variants (nsSNVs). Minor allele frequency (MAF) filtering approach and functional prediction methods are commonly used to identify candidate pathogenic mutations in these studies. Combining multiple functional prediction methods may increase accuracy in prediction. Here, we propose to use a logit model to combine multiple prediction methods and compute an unbiased probability of a rare variant being pathogenic. Also, for the first time we assess the predictive power of seven prediction methods (including SIFT, PolyPhen2, CONDEL, and logit) in predicting pathogenic nsSNVs from other rare variants, which reflects the situation after MAF filtering is done in exome-sequencing studies. We found that a logit model combining all or some original prediction methods outperforms other methods examined, but is unable to discriminate between autosomal dominant and autosomal recessive disease mutations. Finally, based on the predictions of the logit model, we estimate that an individual has around 5% of rare nsSNVs that are pathogenic and carries ∼22 pathogenic derived alleles at least, which if made homozygous by consanguineous marriages may lead to recessive diseases.


Zdroje

1. NgSB, TurnerEH, RobertsonPD, FlygareSD, BighamAW, et al. (2009) Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461: 272–U153.

2. StensonPD, MortM, BallEV, HowellsK, PhillipsAD, et al. (2009) The Human Gene Mutation Database: 2008 update. Genome Med 1: 13.

3. LiMX, GuiHS, KwanJS, BaoSY, ShamPC (2012) A comprehensive framework for prioritizing variants in exome sequencing studies of Mendelian diseases. Nucleic Acids Res 40: e53.

4. GeD, RuzzoEK, ShiannaKV, HeM, PelakK, et al. (2011) SVA: software for annotating and visualizing sequenced human genomes. Bioinformatics 27: 1998–2000.

5. WangK, LiM, HakonarsonH (2010) ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 38: e164.

6. NgPC, HenikoffS (2006) Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet 7: 61–80.

7. Gonzalez-PerezA, Lopez-BigasN (2011) Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. Am J Hum Genet 88: 440–449.

8. LopesMC, JoyceC, RitchieGR, JohnSL, CunninghamF, et al. (2012) A combined functional annotation score for non-synonymous variants. Hum Hered 73: 47–51.

9. AdzhubeiIA, SchmidtS, PeshkinL, RamenskyVE, GerasimovaA, et al. (2010) A method and server for predicting damaging missense mutations. Nat Methods 7: 248–249.

10. KryukovGV, PennacchioLA, SunyaevSR (2007) Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. American journal of human genetics 80: 727–739.

11. SimNL, KumarP, HuJ, HenikoffS, SchneiderG, et al. (2012) SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res

12. ChunS, FayJC (2009) Identification of deleterious mutations within three human genomes. Genome Res 19: 1553–1561.

13. SchwarzJM, RodelspergerC, SchuelkeM, SeelowD (2010) MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods 7: 575–576.

14. CooperGM, StoneEA, AsimenosG, ProgramNCS, GreenED, et al. (2005) Distribution and intensity of constraint in mammalian genomic sequence. Genome Res 15: 901–913.

15. MatthewsBW (1975) Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta 405: 442–451.

16. KwanS, PurcellS, ShamP (2007) Introduction to biometrical genetics. Statistical

17. ScheidelW (1997) Brother-sister marriage in Roman Egypt. J Biosoc Sci 29: 361–371.

18. LeuteneggerAL, PrumB, GeninE, VernyC, LemainqueA, et al. (2003) Estimation of the inbreeding coefficient through use of genomic data. Am J Hum Genet 73: 516–523.

19. LiM, PangS, SongY, KungM, HoSL, et al. (2012) Whole exome sequencing identifies a novel mutation in the transglutaminase 6 gene for spinocerebellar ataxia in a Chinese family. Clin Genet

20. MaoH, YangW, LeePP, HoMH, YangJ, et al. (2012) Exome sequencing identifies novel compound heterozygous mutations of IL-10 receptor 1 in neonatal-onset Crohn's disease. Genes Immun

21. FurneySJ, AlbaMM, Lopez-BigasN (2006) Differences in the evolutionary history of disease genes affected by dominant or recessive mutations. BMC Genomics 7: 165.

22. Jimenez-SanchezG, ChildsB, ValleD (2001) Human disease genes. Nature 409: 853–855.

23. Lopez-BigasN, BlencoweBJ, OuzounisCA (2006) Highly consistent patterns for inherited human diseases at the molecular level. Bioinformatics 22: 269–277.

24. PappB, PalC, HurstLD (2003) Dosage sensitivity and the evolution of gene families in yeast. Nature 424: 194–197.

25. Strachan T, Read AP (1999) Human Molecular Genetics; edition. n, editor. New York: Wiley-Liss.

26. CrowJF, KimuraM (1970) An introduction to population genetics theory. An introduction to population genetics theory

27. SutterJ, TabahL (1952) Effets de la consanguinité et de l'endogamie. Une enquête en Morbihan et Loir-et-Cher. Population (French Edition) 249–266.

28. SlatisHM (1954) A method of estimating the frequency of abnormal autosomal recessive genes in man. American journal of human genetics 6: 412.

29. SlatisHM, ReisRH, HoeneRE (1958) Consanguineous marriages in the Chicago region. American journal of human genetics 10: 446.

30. Scott-EmuakporAB (1974) The mutation load in an African population. I. An analysis of consanguineous marriages in Nigeria. American journal of human genetics 26: 674.

31. MortonNE, CrowJF, MullerH (1956) An estimate of the mutational damage in man from data on consanguineous marriages. Proceedings of the National Academy of Sciences of the United States of America 42: 855.

32. BittlesAH, NeelJV (1994) The costs of human inbreeding and their implications for variations at the DNA level. Nature genetics 8: 117–121.

33. KondrashovAS (1995) Contamination of the genome by very slightly deleterious mutations: why have we not died 100 times over? Journal of theoretical biology 175: 583–594.

34. RiazuddinS, CasteleinCM, AhmedZM, LalwaniAK, MastroianniMA, et al. (2000) Dominant modifier DFNM1 suppresses recessive deafness DFNB26. Nature genetics 26: 431–434.

35. OpreaGE, KröberS, McWhorterML, RossollW, MüllerS, et al. (2008) Plastin 3 is a protective modifier of autosomal recessive spinal muscular atrophy. Science 320: 524–527.

36. KaplanBS, KaplanP, de ChadarevianJP, JequierS, O'ReganS, et al. (1988) Variable expression of autosomal recessive polycystic kidney disease and congenital hepatic fibrosis within a family. American journal of medical genetics 29: 639–647.

37. MacArthurDG, BalasubramanianS, FrankishA, HuangN, MorrisJ, et al. (2012) A systematic survey of loss-of-function variants in human protein-coding genes. Science 335: 823–828.

38. GoecksJ, NekrutenkoA, TaylorJ, GalaxyT (2010) Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 11: R86.

39. GiardineB, RiemerC, HardisonRC, BurhansR, ElnitskiL, et al. (2005) Galaxy: a platform for interactive large-scale genome analysis. Genome Res 15: 1451–1455.

40. LiuX, JianX, BoerwinkleE (2011) dbNSFP: A lightweight database of human nonsynonymous SNPs and their functional predictions. Human mutation 32: 894–899.

41. LiH, DurbinR (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760.

42. McKennaA, HannaM, BanksE, SivachenkoA, CibulskisK, et al. (2010) The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20: 1297–1303.

43. Davis J, Goadrich M (2006) The Relationship Between Precision-Recall and ROC Curves. 23rd International Conference on Machine Learning (ICML). Pittsburgh, PA, USA.

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2013 Číslo 1
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Aktuální možnosti diagnostiky a léčby litiáz
nový kurz
Autori: MUDr. Tomáš Ürge, PhD.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#