Prediction of phase-separation propensities of disordered proteins from sequence

IF 9.1 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Sören von Bülow, Giulio Tesei, Fatima Kamal Zaidi, Tanja Mittag, Kresten Lindorff-Larsen
{"title":"Prediction of phase-separation propensities of disordered proteins from sequence","authors":"Sören von Bülow, Giulio Tesei, Fatima Kamal Zaidi, Tanja Mittag, Kresten Lindorff-Larsen","doi":"10.1073/pnas.2417920122","DOIUrl":null,"url":null,"abstract":"Phase separation is one possible mechanism governing the selective cellular enrichment of biomolecular constituents for processes such as transcriptional activation, mRNA regulation, and immune signaling. Phase separation is mediated by multivalent interactions of macromolecules including intrinsically disordered proteins and regions (IDRs). Despite considerable advances in experiments, theory, and simulations, the prediction of the thermodynamics of IDR phase behavior remains challenging. We combined coarse-grained molecular dynamics simulations and active learning to develop a fast and accurate machine learning model to predict the free energy and saturation concentration for phase separation directly from sequence. We validate the model using computational and previously measured experimental data, as well as new experimental data for six proteins. We apply our model to all 27,663 IDRs of chain length up to 800 residues in the human proteome and find that 1,420 of these (5%) are predicted to undergo homotypic phase separation with transfer free energies &lt; −2 <jats:italic>k</jats:italic> <jats:sub>B</jats:sub> <jats:italic>T</jats:italic> . We use our model to understand the relationship between single-chain compaction and phase separation and find that changes from charge- to hydrophobicity-mediated interactions can break the symmetry between intra- and intermolecular interactions. We also provide proof of principle for how the model can be used in force field refinement. Our work refines and quantifies the established rules governing the connection between sequence features and phase-separation propensities, and our prediction models will be useful for interpreting and designing cellular experiments on the role of phase separation, and for the design of IDRs with specific phase-separation propensities.","PeriodicalId":20548,"journal":{"name":"Proceedings of the National Academy of Sciences of the United States of America","volume":"18 1","pages":""},"PeriodicalIF":9.1000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences of the United States of America","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1073/pnas.2417920122","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Phase separation is one possible mechanism governing the selective cellular enrichment of biomolecular constituents for processes such as transcriptional activation, mRNA regulation, and immune signaling. Phase separation is mediated by multivalent interactions of macromolecules including intrinsically disordered proteins and regions (IDRs). Despite considerable advances in experiments, theory, and simulations, the prediction of the thermodynamics of IDR phase behavior remains challenging. We combined coarse-grained molecular dynamics simulations and active learning to develop a fast and accurate machine learning model to predict the free energy and saturation concentration for phase separation directly from sequence. We validate the model using computational and previously measured experimental data, as well as new experimental data for six proteins. We apply our model to all 27,663 IDRs of chain length up to 800 residues in the human proteome and find that 1,420 of these (5%) are predicted to undergo homotypic phase separation with transfer free energies < −2 k B T . We use our model to understand the relationship between single-chain compaction and phase separation and find that changes from charge- to hydrophobicity-mediated interactions can break the symmetry between intra- and intermolecular interactions. We also provide proof of principle for how the model can be used in force field refinement. Our work refines and quantifies the established rules governing the connection between sequence features and phase-separation propensities, and our prediction models will be useful for interpreting and designing cellular experiments on the role of phase separation, and for the design of IDRs with specific phase-separation propensities.
序列中无序蛋白相分离倾向的预测
相分离是调控生物分子组分选择性细胞富集的一种可能机制,可用于转录激活、mRNA调控和免疫信号传导等过程。相分离是由大分子的多价相互作用介导的,包括内在无序的蛋白质和区域(IDRs)。尽管在实验、理论和模拟方面取得了相当大的进展,但IDR相行为的热力学预测仍然具有挑战性。我们将粗粒度分子动力学模拟与主动学习相结合,建立了一个快速准确的机器学习模型,直接从序列中预测相分离的自由能和饱和浓度。我们使用计算和先前测量的实验数据以及六种蛋白质的新实验数据验证了该模型。我们将我们的模型应用于人类蛋白质组中所有27,663个链长不超过800个残基的idr,发现其中1,420个(5%)预计会发生具有传递自由能的同型相分离;−2ktb。我们使用我们的模型来理解单链压实和相分离之间的关系,并发现从电荷介导的相互作用到疏水介导的相互作用的变化可以打破分子内和分子间相互作用之间的对称性。我们还提供了如何将该模型用于力场细化的原理证明。我们的工作细化和量化了控制序列特征和相分离倾向之间联系的既定规则,我们的预测模型将有助于解释和设计有关相分离作用的细胞实验,以及设计具有特定相分离倾向的idr。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
19.00
自引率
0.90%
发文量
3575
审稿时长
2.5 months
期刊介绍: The Proceedings of the National Academy of Sciences (PNAS), a peer-reviewed journal of the National Academy of Sciences (NAS), serves as an authoritative source for high-impact, original research across the biological, physical, and social sciences. With a global scope, the journal welcomes submissions from researchers worldwide, making it an inclusive platform for advancing scientific knowledge.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信