利用深度学习生成蛋白质结构以发现通路

IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL
Konstantia Georgouli, Robert R. Stephany, Jeremy O. B. Tempkin, Claudio Santiago, Fikret Aydin, Mark A. Heimann, Loïc Pottier, Xiaohua Zhang, Timothy S. Carpenter, Tim Hsu, Dwight V. Nissley, Frederick H. Streitz, Felice C. Lightstone, Helgi I. Ingolfsson, Peer-Timo Bremer
{"title":"利用深度学习生成蛋白质结构以发现通路","authors":"Konstantia Georgouli, Robert R. Stephany, Jeremy O. B. Tempkin, Claudio Santiago, Fikret Aydin, Mark A. Heimann, Loïc Pottier, Xiaohua Zhang, Timothy S. Carpenter, Tim Hsu, Dwight V. Nissley, Frederick H. Streitz, Felice C. Lightstone, Helgi I. Ingolfsson, Peer-Timo Bremer","doi":"10.1021/acs.jctc.4c00816","DOIUrl":null,"url":null,"abstract":"Resolving the intricate details of biological phenomena at the molecular level is fundamentally limited by both length- and time scales that can be probed experimentally. Molecular dynamics (MD) simulations at various scales are powerful tools frequently employed to offer valuable biological insights beyond experimental resolution. However, while it is relatively simple to observe long-lived, stable configurations of, for example, proteins, at the required spatial resolution, simulating the more interesting rare transitions between such states often takes orders of magnitude longer than what is feasible even on the largest supercomputers available today. One common aspect of this challenge is pathway discovery, where the start and end states of a scientific phenomenon are known or can be approximated, but the mechanistic details in between are unknown. Here, we propose a representation-learning-based solution that uses interpolation and extrapolation in an abstract representation space to synthesize potential transition states, which are automatically validated using MD simulations. The new simulations of the synthesized transition states are subsequently incorporated into the representation learning, leading to an iterative framework for targeted path sampling. Our approach is demonstrated by recovering the transition of a RAS-RAF protein domain (CRD) from membrane-free to interacting with the membrane using coarse-grain MD simulations.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"78 1","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generating Protein Structures for Pathway Discovery Using Deep Learning\",\"authors\":\"Konstantia Georgouli, Robert R. Stephany, Jeremy O. B. Tempkin, Claudio Santiago, Fikret Aydin, Mark A. Heimann, Loïc Pottier, Xiaohua Zhang, Timothy S. Carpenter, Tim Hsu, Dwight V. Nissley, Frederick H. Streitz, Felice C. Lightstone, Helgi I. Ingolfsson, Peer-Timo Bremer\",\"doi\":\"10.1021/acs.jctc.4c00816\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Resolving the intricate details of biological phenomena at the molecular level is fundamentally limited by both length- and time scales that can be probed experimentally. Molecular dynamics (MD) simulations at various scales are powerful tools frequently employed to offer valuable biological insights beyond experimental resolution. However, while it is relatively simple to observe long-lived, stable configurations of, for example, proteins, at the required spatial resolution, simulating the more interesting rare transitions between such states often takes orders of magnitude longer than what is feasible even on the largest supercomputers available today. One common aspect of this challenge is pathway discovery, where the start and end states of a scientific phenomenon are known or can be approximated, but the mechanistic details in between are unknown. Here, we propose a representation-learning-based solution that uses interpolation and extrapolation in an abstract representation space to synthesize potential transition states, which are automatically validated using MD simulations. The new simulations of the synthesized transition states are subsequently incorporated into the representation learning, leading to an iterative framework for targeted path sampling. Our approach is demonstrated by recovering the transition of a RAS-RAF protein domain (CRD) from membrane-free to interacting with the membrane using coarse-grain MD simulations.\",\"PeriodicalId\":45,\"journal\":{\"name\":\"Journal of Chemical Theory and Computation\",\"volume\":\"78 1\",\"pages\":\"\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Theory and Computation\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jctc.4c00816\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.4c00816","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

从根本上说,在分子水平上解决生物现象的复杂细节问题,受到实验探究的长度和时间尺度的限制。各种尺度的分子动力学(MD)模拟是一种功能强大的工具,经常被用来提供超出实验分辨率的有价值的生物学见解。然而,虽然在所需的空间分辨率下观察蛋白质等的长期稳定构型相对简单,但模拟这些状态之间更有趣的罕见转变所需的时间往往比当今最大的超级计算机所能处理的时间长几个数量级。这一挑战的一个常见方面是路径发现,在这种情况下,科学现象的起始和结束状态是已知的,或者可以近似得到,但两者之间的机理细节却是未知的。在这里,我们提出了一种基于表征学习的解决方案,利用抽象表征空间中的内插法和外推法合成潜在的过渡状态,并通过 MD 模拟自动进行验证。合成的过渡状态的新模拟随后被纳入表征学习,从而形成了一个目标路径采样的迭代框架。我们的方法通过使用粗粒度 MD 模拟恢复 RAS-RAF 蛋白结构域 (CRD) 从无膜到与膜相互作用的转变过程进行了演示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Generating Protein Structures for Pathway Discovery Using Deep Learning

Generating Protein Structures for Pathway Discovery Using Deep Learning
Resolving the intricate details of biological phenomena at the molecular level is fundamentally limited by both length- and time scales that can be probed experimentally. Molecular dynamics (MD) simulations at various scales are powerful tools frequently employed to offer valuable biological insights beyond experimental resolution. However, while it is relatively simple to observe long-lived, stable configurations of, for example, proteins, at the required spatial resolution, simulating the more interesting rare transitions between such states often takes orders of magnitude longer than what is feasible even on the largest supercomputers available today. One common aspect of this challenge is pathway discovery, where the start and end states of a scientific phenomenon are known or can be approximated, but the mechanistic details in between are unknown. Here, we propose a representation-learning-based solution that uses interpolation and extrapolation in an abstract representation space to synthesize potential transition states, which are automatically validated using MD simulations. The new simulations of the synthesized transition states are subsequently incorporated into the representation learning, leading to an iterative framework for targeted path sampling. Our approach is demonstrated by recovering the transition of a RAS-RAF protein domain (CRD) from membrane-free to interacting with the membrane using coarse-grain MD simulations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Chemical Theory and Computation
Journal of Chemical Theory and Computation 化学-物理:原子、分子和化学物理
CiteScore
9.90
自引率
16.40%
发文量
568
审稿时长
1 months
期刊介绍: The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信