基于单链性质优化的内在无序蛋白液-液相行为精确模型

Proceedings of the National Academy of Sciences Pub Date : 2021-06-23 DOI:10.1101/2021.06.23.449550

G. Tesei, Thea K. Schulze, R. Crehuet, K. Lindorff-Larsen

{"title":"基于单链性质优化的内在无序蛋白液-液相行为精确模型","authors":"G. Tesei, Thea K. Schulze, R. Crehuet, K. Lindorff-Larsen","doi":"10.1101/2021.06.23.449550","DOIUrl":null,"url":null,"abstract":"Significance Cells may compartmentalize proteins via a demixing process known as liquid–liquid phase separation (LLPS), which is often driven by intrinsically disordered proteins (IDPs) and regions. Protein condensates arising from LLPS may develop into insoluble protein aggregates, as in neurodegenerative diseases and cancer. Understanding the process of formation, dissolution, and aging of protein condensates requires models that accurately capture the underpinning interactions at the residue level. In this work, we leverage data from biophysical experiments on IDPs in dilute solution to develop a sequence-dependent model which predicts conformational and phase behavior of diverse and unrelated protein sequences with good accuracy. Using the model, we gain insight into the coupling between chain compaction and LLPS propensity. Many intrinsically disordered proteins (IDPs) may undergo liquid–liquid phase separation (LLPS) and participate in the formation of membraneless organelles in the cell, thereby contributing to the regulation and compartmentalization of intracellular biochemical reactions. The phase behavior of IDPs is sequence dependent, and its investigation through molecular simulations requires protein models that combine computational efficiency with an accurate description of intramolecular and intermolecular interactions. We developed a general coarse-grained model of IDPs, with residue-level detail, based on an extensive set of experimental data on single-chain properties. Ensemble-averaged experimental observables are predicted from molecular simulations, and a data-driven parameter-learning procedure is used to identify the residue-specific model parameters that minimize the discrepancy between predictions and experiments. The model accurately reproduces the experimentally observed conformational propensities of a set of IDPs. Through two-body as well as large-scale molecular simulations, we show that the optimization of the intramolecular interactions results in improved predictions of protein self-association and LLPS.","PeriodicalId":20595,"journal":{"name":"Proceedings of the National Academy of Sciences","volume":"31 13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"105","resultStr":"{\"title\":\"Accurate model of liquid–liquid phase behavior of intrinsically disordered proteins from optimization of single-chain properties\",\"authors\":\"G. Tesei, Thea K. Schulze, R. Crehuet, K. Lindorff-Larsen\",\"doi\":\"10.1101/2021.06.23.449550\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Significance Cells may compartmentalize proteins via a demixing process known as liquid–liquid phase separation (LLPS), which is often driven by intrinsically disordered proteins (IDPs) and regions. Protein condensates arising from LLPS may develop into insoluble protein aggregates, as in neurodegenerative diseases and cancer. Understanding the process of formation, dissolution, and aging of protein condensates requires models that accurately capture the underpinning interactions at the residue level. In this work, we leverage data from biophysical experiments on IDPs in dilute solution to develop a sequence-dependent model which predicts conformational and phase behavior of diverse and unrelated protein sequences with good accuracy. Using the model, we gain insight into the coupling between chain compaction and LLPS propensity. Many intrinsically disordered proteins (IDPs) may undergo liquid–liquid phase separation (LLPS) and participate in the formation of membraneless organelles in the cell, thereby contributing to the regulation and compartmentalization of intracellular biochemical reactions. The phase behavior of IDPs is sequence dependent, and its investigation through molecular simulations requires protein models that combine computational efficiency with an accurate description of intramolecular and intermolecular interactions. We developed a general coarse-grained model of IDPs, with residue-level detail, based on an extensive set of experimental data on single-chain properties. Ensemble-averaged experimental observables are predicted from molecular simulations, and a data-driven parameter-learning procedure is used to identify the residue-specific model parameters that minimize the discrepancy between predictions and experiments. The model accurately reproduces the experimentally observed conformational propensities of a set of IDPs. Through two-body as well as large-scale molecular simulations, we show that the optimization of the intramolecular interactions results in improved predictions of protein self-association and LLPS.\",\"PeriodicalId\":20595,\"journal\":{\"name\":\"Proceedings of the National Academy of Sciences\",\"volume\":\"31 13 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"105\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the National Academy of Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2021.06.23.449550\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2021.06.23.449550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 105

摘要

细胞可以通过被称为液-液相分离(LLPS)的分离过程将蛋白质区隔，这通常是由内在无序蛋白(IDPs)和区域驱动的。在神经退行性疾病和癌症中，由LLPS产生的蛋白质凝聚物可能发展成不溶性蛋白质聚集体。要了解蛋白质凝聚物的形成、溶解和老化过程，就需要能够准确捕捉残留物水平上基础相互作用的模型。在这项工作中，我们利用来自稀释溶液中IDPs生物物理实验的数据来开发一个序列依赖模型，该模型可以很准确地预测多种不相关蛋白质序列的构象和相行为。利用该模型，我们深入了解了链压实和LLPS倾向之间的耦合。许多内在无序蛋白(IDPs)可能发生液-液相分离(LLPS)并参与细胞内无膜细胞器的形成，从而参与细胞内生化反应的调控和区隔化。IDPs的相行为依赖于序列，通过分子模拟对其进行研究需要将计算效率与分子内和分子间相互作用的准确描述相结合的蛋白质模型。基于单链性质的大量实验数据，我们开发了一个通用的粗粒度IDPs模型，具有残留级细节。从分子模拟中预测集合平均实验观测值，并使用数据驱动的参数学习过程来识别残差特定模型参数，以最大限度地减少预测与实验之间的差异。该模型准确地再现了实验观察到的一组IDPs的构象倾向。通过两体和大规模分子模拟，我们表明优化分子内相互作用可以改善蛋白质自结合和LLPS的预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Accurate model of liquid–liquid phase behavior of intrinsically disordered proteins from optimization of single-chain properties

Significance Cells may compartmentalize proteins via a demixing process known as liquid–liquid phase separation (LLPS), which is often driven by intrinsically disordered proteins (IDPs) and regions. Protein condensates arising from LLPS may develop into insoluble protein aggregates, as in neurodegenerative diseases and cancer. Understanding the process of formation, dissolution, and aging of protein condensates requires models that accurately capture the underpinning interactions at the residue level. In this work, we leverage data from biophysical experiments on IDPs in dilute solution to develop a sequence-dependent model which predicts conformational and phase behavior of diverse and unrelated protein sequences with good accuracy. Using the model, we gain insight into the coupling between chain compaction and LLPS propensity. Many intrinsically disordered proteins (IDPs) may undergo liquid–liquid phase separation (LLPS) and participate in the formation of membraneless organelles in the cell, thereby contributing to the regulation and compartmentalization of intracellular biochemical reactions. The phase behavior of IDPs is sequence dependent, and its investigation through molecular simulations requires protein models that combine computational efficiency with an accurate description of intramolecular and intermolecular interactions. We developed a general coarse-grained model of IDPs, with residue-level detail, based on an extensive set of experimental data on single-chain properties. Ensemble-averaged experimental observables are predicted from molecular simulations, and a data-driven parameter-learning procedure is used to identify the residue-specific model parameters that minimize the discrepancy between predictions and experiments. The model accurately reproduces the experimentally observed conformational propensities of a set of IDPs. Through two-body as well as large-scale molecular simulations, we show that the optimization of the intramolecular interactions results in improved predictions of protein self-association and LLPS.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the National Academy of Sciences

自引率

0.00%

发文量