#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

The Use of Orthologous Sequences to Predict the Impact of Amino Acid Substitutions on Protein Function


Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR), in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that “resurrects” the human-macaque ancestor) result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an “ancestral site preservation” measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.


Vyšlo v časopise: The Use of Orthologous Sequences to Predict the Impact of Amino Acid Substitutions on Protein Function. PLoS Genet 6(5): e32767. doi:10.1371/journal.pgen.1000968
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1000968

Souhrn

Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR), in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that “resurrects” the human-macaque ancestor) result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an “ancestral site preservation” measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.


Zdroje

1. StensonPD

MortM

BallEV

HowellsK

PhillipsAD

2009 The Human Gene Mutation Database: 2008 update. Genome Med 1 13

2. NgPC

HenikoffS

2006 Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet 7 61 80

3. KrawczakM

BallEV

CooperDN

1998 Neighboring-nucleotide effects on the rates of germ-line single-base-pair substitution in human genes. Am J Hum Genet 63 474 488

4. MillerMP

KumarS

2001 Understanding human disease mutations through the use of interspecific genetic variation. Hum Mol Genet 10 2319 2328

5. GranthamR

1974 Amino acid difference formula to help explain protein evolution. Science 185 862 864

6. ChasmanD

AdamsRM

2001 Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation. J Mol Biol 307 683 706

7. WangZ

MoultJ

2001 SNPs, protein structure, and disease. Hum Mutat 17 263 270

8. SunyaevS

RamenskyV

KochI

LatheW3rd

KondrashovAS

2001 Prediction of deleterious human alleles. Hum Mol Genet 10 591 597

9. NgPC

HenikoffS

2002 Accounting for human polymorphisms predicted to affect protein function. Genome Res 12 436 446

10. ThomasPD

CampbellMJ

KejariwalA

MiH

KarlakB

2003 PANTHER: a library of protein families and subfamilies indexed by function. Genome Res 13 2129 2141

11. StoneEA

SidowA

2005 Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. Genome Res 15 978 986

12. Ferrer-CostaC

GelpiJL

ZamakolaL

ParragaI

de la CruzX

2005 PMUT: a web-based tool for the annotation of pathological mutations on proteins. Bioinformatics 21 3176 3178

13. DayhoffMO

1969 Computer analysis of protein evolution. Sci Am 221 86 95

14. MiyataT

MiyazawaS

YasunagaT

1979 Two types of amino acid substitutions in protein evolution. J Mol Evol 12 219 236

15. ShuY

LeabmanMK

FengB

MangraviteLM

HuangCC

2003 Evolutionary conservation predicts function of variants of the human organic cation transporter, OCT1. Proc Natl Acad Sci U S A 100 5902 5907

16. RozenR

1997 Genetic predisposition to hyperhomocysteinemia: deficiency of methylenetetrahydrofolate reductase (MTHFR). Thromb Haemost 78 523 526

17. SelhubJ

1999 Homocysteine metabolism. Annu Rev Nutr 19 217 246

18. FrosstP

BlomHJ

MilosR

GoyetteP

SheppardCA

1995 A candidate genetic risk factor for vascular disease: a common mutation in methylenetetrahydrofolate reductase. Nat Genet 10 111 113

19. WaldDS

LawM

MorrisJK

2002 Homocysteine and cardiovascular disease: evidence on causality from a meta-analysis. Bmj 325 1202

20. van der LindenIJ

AfmanLA

HeilSG

BlomHJ

2006 Genetic variation in genes of folate metabolism and neural-tube defect risk. Proc Nutr Soc 65 204 215

21. MariniNJ

GinJ

ZiegleJ

KehoKH

GinzingerD

2008 The prevalence of folate-remedial MTHFR enzyme variants in humans. Proc Natl Acad Sci U S A 105 8055 8060

22. MartinYN

SalavaggioneOE

EckloffBW

WiebenED

SchaidDJ

2006 Human methylenetetrahydrofolate reductase pharmacogenomics: gene resequencing and functional genomics. Pharmacogenet Genomics 16 265 277

23. ShanX

WangL

HoffmasterR

KrugerWD

1999 Functional characterization of human methylenetetrahydrofolate reductase in Saccharomyces cerevisiae. J Biol Chem 274 32613 32618

24. DunnettCW

1964 New tables for Multiple Comparisons with a Control. Biometrics 20 482 491

25. BenjaminiY

HochbergY

1995 Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B 57 289 300

26. KatohK

TohH

2008 Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform 9 286 298

27. YangZ

2007 PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24 1586 1591

28. SaitouN

NeiM

1987 The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4 406 425

29. FitchWM

1970 Distinguishing homologous from analogous proteins. Syst Zool 19 99 113

30. RaymondRK

KastanosEK

ApplingDR

1999 Saccharomyces cerevisiae expresses two genes encoding isozymes of methylenetetrahydrofolate reductase. Arch Biochem Biophys 372 300 308

31. NaulaN

WaltherC

BaumannD

SchweingruberME

2002 Two non-complementing genes encoding enzymatically active methylenetetrahydrofolate reductases control methionine requirement in fission yeast Schizosaccharomyces pombe. Yeast 19 841 848

32. NgPC

HenikoffS

2003 SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 31 3812 3814

33. MarkiewiczP

KleinaLG

CruzC

EhretS

MillerJH

1994 Genetic studies of the lac repressor. XIV. Analysis of 4000 altered Escherichia coli lac repressors reveals essential and non-essential residues, as well as “spacers” which do not require a specific sequence. J Mol Biol 240 421 433

34. SuckowJ

MarkiewiczP

KleinaLG

MillerJ

Kisters-WoikeB

1996 Genetic studies of the Lac repressor. XV: 4000 single amino acid substitutions and analysis of the resulting phenotypes on the basis of the protein structure. J Mol Biol 261 509 523

35. NgPC

HenikoffS

2001 Predicting deleterious amino acid substitutions. Genome Res 11 863 874

36. LiaoBY

ZhangJ

2008 Null mutations in human and mouse orthologs frequently result in different phenotypes. Proc Natl Acad Sci U S A 105 6987 6992

37. KondrashovAS

SunyaevS

KondrashovFA

2002 Dobzhansky-Muller incompatibilities in protein evolution. Proc Natl Acad Sci U S A 99 14878 14883

38. GaoL

ZhangJ

2003 Why are some human disease-associated mutations fixed in mice? Trends Genet 19 678 681

39. KulathinalRJ

BettencourtBR

HartlDL

2004 Compensated deleterious mutations in insect genomes. Science 306 1553 1554

40. ZhangJ

RosenbergHF

2002 Complementary advantageous substitutions in the evolution of an antiviral RNase of higher primates. Proc Natl Acad Sci U S A 99 5486 5491

41. BridghamJT

CarrollSM

ThorntonJW

2006 Evolution of hormone-receptor complexity by molecular exploitation. Science 312 97 101

42. WeinreichDM

DelaneyNF

DepristoMA

HartlDL

2006 Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312 111 114

43. ClarkAG

GlanowskiS

NielsenR

ThomasPD

KejariwalA

2003 Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science 302 1960 1963

44. KondrashovAS

PovolotskayaIS

IvankovDN

KondrashovFA

Rate of sequence divergence under constant selection. Biol Direct 5 5

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2010 Číslo 5
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Aktuální možnosti diagnostiky a léčby litiáz
nový kurz
Autori: MUDr. Tomáš Ürge, PhD.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#