Extending atomic decomposition and many-body representation with a chemistry-motivated approach to machine learning potentials

IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Qi Yu, Ruitao Ma, Chen Qu, Riccardo Conte, Apurba Nandi, Priyanka Pandey, Paul L. Houston, Dong H. Zhang, Joel M. Bowman
{"title":"Extending atomic decomposition and many-body representation with a chemistry-motivated approach to machine learning potentials","authors":"Qi Yu, Ruitao Ma, Chen Qu, Riccardo Conte, Apurba Nandi, Priyanka Pandey, Paul L. Houston, Dong H. Zhang, Joel M. Bowman","doi":"10.1038/s43588-025-00790-0","DOIUrl":null,"url":null,"abstract":"Most widely used machine learning potentials for condensed-phase applications rely on many-body permutationally invariant polynomial or atom-centered neural networks. However, these approaches face challenges in achieving chemical interpretability in atomistic energy decomposition and fully matching the computational efficiency of traditional force fields. Here we present a method that combines aspects of both approaches and balances accuracy and force-field-level speed. This method utilizes a monomer-centered representation, where the potential energy is decomposed into the sum of chemically meaningful monomeric energies. The structural descriptors of monomers are described by one-body and two-body effective interactions, enforced by appropriate sets of permutationally invariant polynomials as inputs to the feed-forward neural networks. Systematic assessments of models for gas-phase water trimer, liquid water, methane–water cluster and liquid carbon dioxide are performed. The improved accuracy, efficiency and flexibility of this method have promise for constructing accurate machine learning potentials and enabling large-scale quantum and classical simulations for complex molecular systems. A machine-learning-potential framework achieves an optimal balance of accuracy and efficiency through monomeric decomposition. Systematic evaluations highlight its potential in large-scale simulations of complex molecular systems.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 5","pages":"418-426"},"PeriodicalIF":18.3000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature computational science","FirstCategoryId":"1085","ListUrlMain":"https://www.nature.com/articles/s43588-025-00790-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Most widely used machine learning potentials for condensed-phase applications rely on many-body permutationally invariant polynomial or atom-centered neural networks. However, these approaches face challenges in achieving chemical interpretability in atomistic energy decomposition and fully matching the computational efficiency of traditional force fields. Here we present a method that combines aspects of both approaches and balances accuracy and force-field-level speed. This method utilizes a monomer-centered representation, where the potential energy is decomposed into the sum of chemically meaningful monomeric energies. The structural descriptors of monomers are described by one-body and two-body effective interactions, enforced by appropriate sets of permutationally invariant polynomials as inputs to the feed-forward neural networks. Systematic assessments of models for gas-phase water trimer, liquid water, methane–water cluster and liquid carbon dioxide are performed. The improved accuracy, efficiency and flexibility of this method have promise for constructing accurate machine learning potentials and enabling large-scale quantum and classical simulations for complex molecular systems. A machine-learning-potential framework achieves an optimal balance of accuracy and efficiency through monomeric decomposition. Systematic evaluations highlight its potential in large-scale simulations of complex molecular systems.

Abstract Image

用化学驱动的方法扩展原子分解和多体表示到机器学习潜能。
在凝聚态应用中,最广泛使用的机器学习潜力依赖于多体排列不变多项式或原子中心神经网络。然而,这些方法在实现原子能量分解的化学可解释性和完全匹配传统力场的计算效率方面面临挑战。在这里,我们提出了一种结合了这两种方法的方法,并平衡了精度和力场级速度。该方法采用以单体为中心的表示,将势能分解为具有化学意义的单体能量的总和。单体的结构描述符由单体和双体有效相互作用来描述,通过适当的排列不变多项式集作为前馈神经网络的输入来强制执行。对气相水三聚体、液态水、甲烷-水簇和液态二氧化碳模型进行了系统评估。该方法提高了精度、效率和灵活性,有望构建准确的机器学习势,并实现复杂分子系统的大规模量子和经典模拟。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
11.70
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信