Fluctuations and the limit of predictability in protein evolution.

Saverio Rossi, Leonardo Di Bari, Martin Weigt, Francesco Zamponi
{"title":"Fluctuations and the limit of predictability in protein evolution.","authors":"Saverio Rossi, Leonardo Di Bari, Martin Weigt, Francesco Zamponi","doi":"10.1088/1361-6633/adea92","DOIUrl":null,"url":null,"abstract":"<p><p>Protein evolution involves mutations occurring across a wide range of time scales. In analogy with disordered systems in statistical physics, this dynamical heterogeneity suggests strong correlations between mutations happening at distinct sites and times. To quantify these correlations, we examine the role of various fluctuation sources in protein evolution, simulated using a data-driven energy landscape as a proxy for protein fitness. By applying spatio-temporal correlation functions developed in the context of disordered physical systems, we disentangle fluctuations originating from the initial condition, i.e. the ancestral sequence from which the evolutionary process originated, from those driven by stochastic mutations along independent evolutionary paths. Our analysis shows that, in diverse protein families, fluctuations from the ancestral sequence predominate at shorter time scales. This allows us to identify a time scale over which ancestral sequence information persists, enabling its reconstruction. We link this persistence to the strength of epistatic interactions: ancestral sequences with stronger epistatic signatures impact evolutionary trajectories over extended periods. At longer time scales, however, ancestral influence fades as epistatically constrained sites evolve collectively. To confirm this idea, we apply a standard ancestral sequence reconstruction (ASR) algorithm and verify that the time-dependent recovery error is influenced by the properties of the ancestor itself. Overall, our results reveal that the properties of ancestral sequences-particularly their epistatic constraints-influence the initial evolutionary dynamics and the performance of standard ASR algorithms.</p>","PeriodicalId":74666,"journal":{"name":"Reports on progress in physics. Physical Society (Great Britain)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reports on progress in physics. Physical Society (Great Britain)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/1361-6633/adea92","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Protein evolution involves mutations occurring across a wide range of time scales. In analogy with disordered systems in statistical physics, this dynamical heterogeneity suggests strong correlations between mutations happening at distinct sites and times. To quantify these correlations, we examine the role of various fluctuation sources in protein evolution, simulated using a data-driven energy landscape as a proxy for protein fitness. By applying spatio-temporal correlation functions developed in the context of disordered physical systems, we disentangle fluctuations originating from the initial condition, i.e. the ancestral sequence from which the evolutionary process originated, from those driven by stochastic mutations along independent evolutionary paths. Our analysis shows that, in diverse protein families, fluctuations from the ancestral sequence predominate at shorter time scales. This allows us to identify a time scale over which ancestral sequence information persists, enabling its reconstruction. We link this persistence to the strength of epistatic interactions: ancestral sequences with stronger epistatic signatures impact evolutionary trajectories over extended periods. At longer time scales, however, ancestral influence fades as epistatically constrained sites evolve collectively. To confirm this idea, we apply a standard ancestral sequence reconstruction (ASR) algorithm and verify that the time-dependent recovery error is influenced by the properties of the ancestor itself. Overall, our results reveal that the properties of ancestral sequences-particularly their epistatic constraints-influence the initial evolutionary dynamics and the performance of standard ASR algorithms.

蛋白质进化中的波动和可预测性的限制。
蛋白质进化涉及在很长时间尺度上发生的突变。与统计物理中的无序系统类似,这种动态异质性表明在不同地点和时间发生的突变之间存在很强的相关性。为了量化这些相关性,我们研究了各种波动源在蛋白质进化中的作用,使用数据驱动的能量景观作为蛋白质适合度的代理进行模拟。通过应用在无序物理系统背景下开发的时空相关函数,我们将源自初始条件(即进化过程起源的祖先序列)的波动与沿着独立进化路径的随机突变驱动的波动区分开来。我们的分析表明,在不同的蛋白质家族中,来自祖先序列的波动在较短的时间尺度上占主导地位。这使我们能够确定祖先序列信息持续存在的时间尺度,从而使其能够重建。我们将这种持久性与上位性相互作用的强度联系起来:具有更强上位性特征的祖先序列在很长一段时间内影响进化轨迹。然而,在更长的时间尺度上,祖先的影响随着上位性约束的地点集体进化而消失。为了证实这一想法,我们应用了一个标准的祖先序列重建算法,并验证了时间相关的恢复误差受到祖先本身属性的影响。总的来说,我们的研究结果表明,祖先序列的特性——特别是它们的上位性约束——影响了初始进化动力学和标准祖先序列重建算法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信