Bojun Liu, Siqin Cao, Jordan G Boysen, Mingyi Xue, Xuhui Huang
{"title":"Memory kernel minimization-based neural networks for discovering slow collective variables of biomolecular dynamics.","authors":"Bojun Liu, Siqin Cao, Jordan G Boysen, Mingyi Xue, Xuhui Huang","doi":"10.1038/s43588-025-00815-8","DOIUrl":null,"url":null,"abstract":"<p><p>Identifying collective variables (CVs) that accurately capture the slowest timescales of protein conformational changes is crucial to comprehend numerous biological processes. Here we introduce memory kernel minimization-based neural networks (MEMnets), a deep learning framework that accurately identifies the slow CVs of biomolecular dynamics. Unlike popular CV-identification methods, which typically assume Markovian dynamics, MEMnets is built on the integrative generalized master equation theory, which incorporates non-Markovian dynamics by encoding them in a memory kernel for continuous CVs. The key innovation of MEMnets is the identification of optimal CVs by minimizing the upper bound for the time-integrated memory kernels through parallel encoder networks. We demonstrate that MEMnets effectively identifies slow CVs involved in the folding of the FIP35 WW domain, revealing two parallel folding pathways. In addition, we illustrate MEMnets' robust numerical stability in identifying meaningful CVs in large biomolecular dynamic systems with limited sampling by applying it to the clamp opening of bacterial RNA polymerase, a much more complex conformational change.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":12.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature computational science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s43588-025-00815-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying collective variables (CVs) that accurately capture the slowest timescales of protein conformational changes is crucial to comprehend numerous biological processes. Here we introduce memory kernel minimization-based neural networks (MEMnets), a deep learning framework that accurately identifies the slow CVs of biomolecular dynamics. Unlike popular CV-identification methods, which typically assume Markovian dynamics, MEMnets is built on the integrative generalized master equation theory, which incorporates non-Markovian dynamics by encoding them in a memory kernel for continuous CVs. The key innovation of MEMnets is the identification of optimal CVs by minimizing the upper bound for the time-integrated memory kernels through parallel encoder networks. We demonstrate that MEMnets effectively identifies slow CVs involved in the folding of the FIP35 WW domain, revealing two parallel folding pathways. In addition, we illustrate MEMnets' robust numerical stability in identifying meaningful CVs in large biomolecular dynamic systems with limited sampling by applying it to the clamp opening of bacterial RNA polymerase, a much more complex conformational change.