Ruihan Zhou*, Ethan F. Bull-Vulpe, Yuanhui Pan and Francesco Paesani*,
{"title":"通过数据驱动的多体电位实现生物分子模拟的化学准确性:1 .气相聚丙氨酸。","authors":"Ruihan Zhou*, Ethan F. Bull-Vulpe, Yuanhui Pan and Francesco Paesani*, ","doi":"10.1021/acs.jctc.5c00474","DOIUrl":null,"url":null,"abstract":"<p >A predictive understanding of how proteins fold, misfold, and stabilize requires accurate molecular-level insights into the thermodynamic and kinetic forces shaping their backbones. While empirical force fields remain the workhorse of biomolecular simulations, their limited functional forms often fall short in capturing the complex many-body interactions that govern protein dynamics. Quantum-mechanical methods, on the other hand, offer high accuracy but are prohibitively expensive for large biomolecules. In this work, we introduce a generalized, intramolecular formulation of the data-driven many-body MB-nrg formalism that approaches “gold standard” coupled cluster accuracy in simulating polyalanine chains in the gas phase. By decomposing polyalanines into chemically intuitive building blocks, we develop modular and transferable potential energy functions that accurately reproduce reference energies, normal-mode harmonic frequencies, and conformational free-energy landscapes. Compared to empirical force fields commonly used in biosimulations, the MB-nrg potential energy function yields a smoother and more physically grounded free-energy surface, captures transient structural motifs under-represented by empirical force fields, and enables flexible sampling of secondary structure transitions in longer peptides. This work establishes a foundation for extending coupled-cluster-level modeling to larger biomolecular systems under physiologically relevant conditions, while highlighting the methodological challenges that remain in achieving consistent accuracy at scale.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"21 12","pages":"6194–6212"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Chemical Accuracy in Biomolecular Simulations through Data-Driven Many-Body Potentials: I. Polyalanine in the Gas Phase\",\"authors\":\"Ruihan Zhou*, Ethan F. Bull-Vulpe, Yuanhui Pan and Francesco Paesani*, \",\"doi\":\"10.1021/acs.jctc.5c00474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >A predictive understanding of how proteins fold, misfold, and stabilize requires accurate molecular-level insights into the thermodynamic and kinetic forces shaping their backbones. While empirical force fields remain the workhorse of biomolecular simulations, their limited functional forms often fall short in capturing the complex many-body interactions that govern protein dynamics. Quantum-mechanical methods, on the other hand, offer high accuracy but are prohibitively expensive for large biomolecules. In this work, we introduce a generalized, intramolecular formulation of the data-driven many-body MB-nrg formalism that approaches “gold standard” coupled cluster accuracy in simulating polyalanine chains in the gas phase. By decomposing polyalanines into chemically intuitive building blocks, we develop modular and transferable potential energy functions that accurately reproduce reference energies, normal-mode harmonic frequencies, and conformational free-energy landscapes. Compared to empirical force fields commonly used in biosimulations, the MB-nrg potential energy function yields a smoother and more physically grounded free-energy surface, captures transient structural motifs under-represented by empirical force fields, and enables flexible sampling of secondary structure transitions in longer peptides. This work establishes a foundation for extending coupled-cluster-level modeling to larger biomolecular systems under physiologically relevant conditions, while highlighting the methodological challenges that remain in achieving consistent accuracy at scale.</p>\",\"PeriodicalId\":45,\"journal\":{\"name\":\"Journal of Chemical Theory and Computation\",\"volume\":\"21 12\",\"pages\":\"6194–6212\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Theory and Computation\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.jctc.5c00474\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jctc.5c00474","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Toward Chemical Accuracy in Biomolecular Simulations through Data-Driven Many-Body Potentials: I. Polyalanine in the Gas Phase
A predictive understanding of how proteins fold, misfold, and stabilize requires accurate molecular-level insights into the thermodynamic and kinetic forces shaping their backbones. While empirical force fields remain the workhorse of biomolecular simulations, their limited functional forms often fall short in capturing the complex many-body interactions that govern protein dynamics. Quantum-mechanical methods, on the other hand, offer high accuracy but are prohibitively expensive for large biomolecules. In this work, we introduce a generalized, intramolecular formulation of the data-driven many-body MB-nrg formalism that approaches “gold standard” coupled cluster accuracy in simulating polyalanine chains in the gas phase. By decomposing polyalanines into chemically intuitive building blocks, we develop modular and transferable potential energy functions that accurately reproduce reference energies, normal-mode harmonic frequencies, and conformational free-energy landscapes. Compared to empirical force fields commonly used in biosimulations, the MB-nrg potential energy function yields a smoother and more physically grounded free-energy surface, captures transient structural motifs under-represented by empirical force fields, and enables flexible sampling of secondary structure transitions in longer peptides. This work establishes a foundation for extending coupled-cluster-level modeling to larger biomolecular systems under physiologically relevant conditions, while highlighting the methodological challenges that remain in achieving consistent accuracy at scale.
期刊介绍:
The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.