The Use of Orthologous Sequences to Predict the Impact of Amino Acid Substitutions on Protein Function
Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR), in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that “resurrects” the human-macaque ancestor) result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an “ancestral site preservation” measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.
Vyšlo v časopise:
The Use of Orthologous Sequences to Predict the Impact of Amino Acid Substitutions on Protein Function. PLoS Genet 6(5): e32767. doi:10.1371/journal.pgen.1000968
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pgen.1000968
Souhrn
Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR), in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that “resurrects” the human-macaque ancestor) result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an “ancestral site preservation” measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.
Zdroje
1. StensonPD
MortM
BallEV
HowellsK
PhillipsAD
2009 The Human Gene Mutation Database: 2008 update. Genome Med 1 13
2. NgPC
HenikoffS
2006 Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet 7 61 80
3. KrawczakM
BallEV
CooperDN
1998 Neighboring-nucleotide effects on the rates of germ-line single-base-pair substitution in human genes. Am J Hum Genet 63 474 488
4. MillerMP
KumarS
2001 Understanding human disease mutations through the use of interspecific genetic variation. Hum Mol Genet 10 2319 2328
5. GranthamR
1974 Amino acid difference formula to help explain protein evolution. Science 185 862 864
6. ChasmanD
AdamsRM
2001 Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation. J Mol Biol 307 683 706
7. WangZ
MoultJ
2001 SNPs, protein structure, and disease. Hum Mutat 17 263 270
8. SunyaevS
RamenskyV
KochI
LatheW3rd
KondrashovAS
2001 Prediction of deleterious human alleles. Hum Mol Genet 10 591 597
9. NgPC
HenikoffS
2002 Accounting for human polymorphisms predicted to affect protein function. Genome Res 12 436 446
10. ThomasPD
CampbellMJ
KejariwalA
MiH
KarlakB
2003 PANTHER: a library of protein families and subfamilies indexed by function. Genome Res 13 2129 2141
11. StoneEA
SidowA
2005 Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. Genome Res 15 978 986
12. Ferrer-CostaC
GelpiJL
ZamakolaL
ParragaI
de la CruzX
2005 PMUT: a web-based tool for the annotation of pathological mutations on proteins. Bioinformatics 21 3176 3178
13. DayhoffMO
1969 Computer analysis of protein evolution. Sci Am 221 86 95
14. MiyataT
MiyazawaS
YasunagaT
1979 Two types of amino acid substitutions in protein evolution. J Mol Evol 12 219 236
15. ShuY
LeabmanMK
FengB
MangraviteLM
HuangCC
2003 Evolutionary conservation predicts function of variants of the human organic cation transporter, OCT1. Proc Natl Acad Sci U S A 100 5902 5907
16. RozenR
1997 Genetic predisposition to hyperhomocysteinemia: deficiency of methylenetetrahydrofolate reductase (MTHFR). Thromb Haemost 78 523 526
17. SelhubJ
1999 Homocysteine metabolism. Annu Rev Nutr 19 217 246
18. FrosstP
BlomHJ
MilosR
GoyetteP
SheppardCA
1995 A candidate genetic risk factor for vascular disease: a common mutation in methylenetetrahydrofolate reductase. Nat Genet 10 111 113
19. WaldDS
LawM
MorrisJK
2002 Homocysteine and cardiovascular disease: evidence on causality from a meta-analysis. Bmj 325 1202
20. van der LindenIJ
AfmanLA
HeilSG
BlomHJ
2006 Genetic variation in genes of folate metabolism and neural-tube defect risk. Proc Nutr Soc 65 204 215
21. MariniNJ
GinJ
ZiegleJ
KehoKH
GinzingerD
2008 The prevalence of folate-remedial MTHFR enzyme variants in humans. Proc Natl Acad Sci U S A 105 8055 8060
22. MartinYN
SalavaggioneOE
EckloffBW
WiebenED
SchaidDJ
2006 Human methylenetetrahydrofolate reductase pharmacogenomics: gene resequencing and functional genomics. Pharmacogenet Genomics 16 265 277
23. ShanX
WangL
HoffmasterR
KrugerWD
1999 Functional characterization of human methylenetetrahydrofolate reductase in Saccharomyces cerevisiae. J Biol Chem 274 32613 32618
24. DunnettCW
1964 New tables for Multiple Comparisons with a Control. Biometrics 20 482 491
25. BenjaminiY
HochbergY
1995 Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B 57 289 300
26. KatohK
TohH
2008 Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform 9 286 298
27. YangZ
2007 PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24 1586 1591
28. SaitouN
NeiM
1987 The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4 406 425
29. FitchWM
1970 Distinguishing homologous from analogous proteins. Syst Zool 19 99 113
30. RaymondRK
KastanosEK
ApplingDR
1999 Saccharomyces cerevisiae expresses two genes encoding isozymes of methylenetetrahydrofolate reductase. Arch Biochem Biophys 372 300 308
31. NaulaN
WaltherC
BaumannD
SchweingruberME
2002 Two non-complementing genes encoding enzymatically active methylenetetrahydrofolate reductases control methionine requirement in fission yeast Schizosaccharomyces pombe. Yeast 19 841 848
32. NgPC
HenikoffS
2003 SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 31 3812 3814
33. MarkiewiczP
KleinaLG
CruzC
EhretS
MillerJH
1994 Genetic studies of the lac repressor. XIV. Analysis of 4000 altered Escherichia coli lac repressors reveals essential and non-essential residues, as well as “spacers” which do not require a specific sequence. J Mol Biol 240 421 433
34. SuckowJ
MarkiewiczP
KleinaLG
MillerJ
Kisters-WoikeB
1996 Genetic studies of the Lac repressor. XV: 4000 single amino acid substitutions and analysis of the resulting phenotypes on the basis of the protein structure. J Mol Biol 261 509 523
35. NgPC
HenikoffS
2001 Predicting deleterious amino acid substitutions. Genome Res 11 863 874
36. LiaoBY
ZhangJ
2008 Null mutations in human and mouse orthologs frequently result in different phenotypes. Proc Natl Acad Sci U S A 105 6987 6992
37. KondrashovAS
SunyaevS
KondrashovFA
2002 Dobzhansky-Muller incompatibilities in protein evolution. Proc Natl Acad Sci U S A 99 14878 14883
38. GaoL
ZhangJ
2003 Why are some human disease-associated mutations fixed in mice? Trends Genet 19 678 681
39. KulathinalRJ
BettencourtBR
HartlDL
2004 Compensated deleterious mutations in insect genomes. Science 306 1553 1554
40. ZhangJ
RosenbergHF
2002 Complementary advantageous substitutions in the evolution of an antiviral RNase of higher primates. Proc Natl Acad Sci U S A 99 5486 5491
41. BridghamJT
CarrollSM
ThorntonJW
2006 Evolution of hormone-receptor complexity by molecular exploitation. Science 312 97 101
42. WeinreichDM
DelaneyNF
DepristoMA
HartlDL
2006 Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312 111 114
43. ClarkAG
GlanowskiS
NielsenR
ThomasPD
KejariwalA
2003 Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science 302 1960 1963
44. KondrashovAS
PovolotskayaIS
IvankovDN
KondrashovFA
Rate of sequence divergence under constant selection. Biol Direct 5 5
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2010 Číslo 5
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Common Genetic Variants near the Brittle Cornea Syndrome Locus Influence the Blinding Disease Risk Factor Central Corneal Thickness
- All About Mitochondrial Eve: An Interview with Rebecca Cann
- The Relationship among Gene Expression, the Evolution of Gene Dosage, and the Rate of Protein Evolution
- SMA-10/LRIG Is a Conserved Transmembrane Protein that Enhances Bone Morphogenetic Protein Signaling