Forecasting protein evolution by integrating birth-death population models with structurally constrained substitution models.

IF 6.4 1区 生物学 Q1 BIOLOGY
eLife Pub Date : 2025-09-24 DOI:10.7554/eLife.106365
David Ferreiro, Luis Daniel González-Vázquez, Ana Prado-Comesaña, Miguel Arenas
{"title":"Forecasting protein evolution by integrating birth-death population models with structurally constrained substitution models.","authors":"David Ferreiro, Luis Daniel González-Vázquez, Ana Prado-Comesaña, Miguel Arenas","doi":"10.7554/eLife.106365","DOIUrl":null,"url":null,"abstract":"<p><p>Evolutionary studies in population genetics and ecology were mainly focused on predicting and understanding past evolutionary events. Recently, however, a growing trend explores the prediction of evolutionary trajectories toward the future promoted by its wide variety of applications. In this context, we introduce a forecasting protein evolution method that integrates birth-death population models with substitution models that consider selection on protein folding stability. In contrast to traditional population genetics methods that usually make the unrealistic assumption of simulating molecular evolution separately from the evolutionary history, the present method combines both processes to simultaneously model forward-in-time birth-death evolutionary trajectories and protein evolution under structurally constrained substitution models that outperformed traditional empirical substitution models. We implemented the method into a freely available computer framework. We evaluated the accuracy of the predictions with several monitored viral proteins of broad interest. Overall, the method showed acceptable errors in predicting the folding stability of the forecasted protein variants, but, expectedly, the errors were larger in the prediction of the corresponding sequences. We conclude that forecasting protein evolution is feasible in certain evolutionary scenarios and provide suggestions to enhance its accuracy by improving the underlying models of evolution.</p>","PeriodicalId":11640,"journal":{"name":"eLife","volume":"14 ","pages":""},"PeriodicalIF":6.4000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12459951/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"eLife","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7554/eLife.106365","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Evolutionary studies in population genetics and ecology were mainly focused on predicting and understanding past evolutionary events. Recently, however, a growing trend explores the prediction of evolutionary trajectories toward the future promoted by its wide variety of applications. In this context, we introduce a forecasting protein evolution method that integrates birth-death population models with substitution models that consider selection on protein folding stability. In contrast to traditional population genetics methods that usually make the unrealistic assumption of simulating molecular evolution separately from the evolutionary history, the present method combines both processes to simultaneously model forward-in-time birth-death evolutionary trajectories and protein evolution under structurally constrained substitution models that outperformed traditional empirical substitution models. We implemented the method into a freely available computer framework. We evaluated the accuracy of the predictions with several monitored viral proteins of broad interest. Overall, the method showed acceptable errors in predicting the folding stability of the forecasted protein variants, but, expectedly, the errors were larger in the prediction of the corresponding sequences. We conclude that forecasting protein evolution is feasible in certain evolutionary scenarios and provide suggestions to enhance its accuracy by improving the underlying models of evolution.

Abstract Image

Abstract Image

Abstract Image

结合结构约束替代模型的出生-死亡种群模型预测蛋白质进化。
种群遗传学和生态学的进化研究主要集中在预测和理解过去的进化事件。然而,最近,由于其广泛的应用,越来越多的趋势是探索对未来进化轨迹的预测。在这种背景下,我们引入了一种预测蛋白质进化的方法,该方法将出生-死亡群体模型与考虑蛋白质折叠稳定性选择的替代模型相结合。传统的群体遗传学方法通常将分子进化与进化历史分开模拟,这是不切实际的假设,与此相反,该方法结合了这两个过程,同时模拟了结构约束替代模型下的前向出生-死亡进化轨迹和蛋白质进化,优于传统的经验替代模型。我们将这种方法应用到一个免费的计算机框架中。我们用几种广泛关注的监测病毒蛋白来评估预测的准确性。总体而言,该方法在预测蛋白质变异的折叠稳定性方面显示出可接受的误差,但在预测相应序列时,预期的误差更大。我们得出结论,预测蛋白质进化在某些进化情景下是可行的,并提出了通过改进进化基础模型来提高其准确性的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
eLife
eLife BIOLOGY-
CiteScore
12.90
自引率
3.90%
发文量
3122
审稿时长
17 weeks
期刊介绍: eLife is a distinguished, not-for-profit, peer-reviewed open access scientific journal that specializes in the fields of biomedical and life sciences. eLife is known for its selective publication process, which includes a variety of article types such as: Research Articles: Detailed reports of original research findings. Short Reports: Concise presentations of significant findings that do not warrant a full-length research article. Tools and Resources: Descriptions of new tools, technologies, or resources that facilitate scientific research. Research Advances: Brief reports on significant scientific advancements that have immediate implications for the field. Scientific Correspondence: Short communications that comment on or provide additional information related to published articles. Review Articles: Comprehensive overviews of a specific topic or field within the life sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信